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Abstract 

We study the optimal approximation of the solution of an operator equa- 

, tion A{u) = / by four types of mappings: a) linear mappings of rank n; b) 

. n-term approximation with respect to a Riesz basis; c) approximation based 

, on linear information about the right hand side /; d) continuous mappings. 

. We consider worst case errors, where f is an element of the unit ball of a 
O . 

. Sobolev or Besov space Bg{Lp{Q)) and i7 C M is a bounded Lipschitz do- 

o 

s 

> 
X 



main; the error is always measured in the H'^-noim. The respective widths 
are the linear widths (or approximation numbers), the nonlinear widths, the 
Gelfand widths, and the manifold widths. As a technical tool, we also study 
the Bernstein numbers. Our main results are the following. If p > 2 then the 
order of convergence is the same for all four classes of approximations. In 
particular, the best linear approximations are of the same order as the best 
nonlinear ones. The best linear approximation can be quite difficult to realize 
as a numerical algorithm since the optimal Galerkin space usually depends on 
the operator and of the shape of the domain Q. For p < 2 there is a difference, 
nonlinear approximations are better than linear ones. However, in this case, it 
turns out that linear information about the right hand side / is again optimal. 
Our main theoretical tool is the best n-term approximation with respect to 
an optimal Riesz basis and related nonlinear widths. These general results are 
used to study the Poisson equation in a polygonal domain. It turns out that 
best n-term wavelet approximation is (almost) optimal. The main results of 
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this paper arc about approximation, not about computation. However, we also 
discuss consequences of the results for the numerical complexity of operator 
equations. 

AMS subject classification: 41A25, 41A46, 41A65, 42C40, 65C99 

Key Words: Elliptic operator equations, worst case error, linear and nonlinear 
approximation methods, best n-term approximation, Besov spaces, Gelfand widths, 
Bernstein widths, manifold widths. 

1 Introduction 

We study the optimal approximation of the solution of an operator equation 

(1) A{u) = /, 
where ^ is a linear operator 

(2) A-.H^G 

from a Hilbert space H to another Hilbert space G. We always assume that A is 
boundcdly invertible, and so (1) has a unique solution for any f E G. We have in 
mind the more specific situation of an elliptic operator equation which is given as 
follows. Assume that Q C R'' is a bounded Lipschitz domain and assume that 

(3) A : H^{n) H-'{n) 

is an isomorphism, where s > 0. (For the definition of the Sobolev spaces Hq{Q) 
and H~'^{Q), we refer to the Subsections 5.7, 5.8 and 5.9). A standard case (for 
second order elliptic boundary value problems for PDEs) is s = 1, but also other 
values of s are of interest. Now we put H = Hq{Q) and G = H~^{Q). Since A is 
boundedly invertible, the inverse mapping S : G H is well defined. This mapping 
is sometimes called the solution operator — in particular if we want to compute the 
solution u — S{f) from the given right-hand side A{u) = f. 

We use linear and (different kinds of) nonlinear mappings Sn for the approxima- 
tion of the solution u — A~^{f) for / contained in F C C We consider the worst 
case error 

(4) e(Sn,F,H)^ sup \\A-\f) - Sn(f)\\H, 

\\f\\F<l 
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where F is a normed (or quasi-normcd) subspace of G. In our main results, F will 
be a Sobolev or Besov space. ^ Hence we use the following commutative diagram 

G ^ H 

I \ / Sp 
F. 

Here I : F ^ G denotes the identity and Sp the restriction of S to F. In the specific 
case (3) this diagram is given by 

I \ / St 

where B~''^^^{Lp{Q)) denotes a Besov space compactly embedded into cf. 
the Appendix for a definition, and St the restriction of S to B~^~^^{Lp{fl)). We are 
interested in approximations that have the optimal order of convergence depending 
on n, where n denotes the degree of freedom. In general our results are construc- 
tive in a mathematical sense, because we can describe optimal approximations S'„ 
in mathematical terms. This does not mean, however, that these descriptions are 
constructive in a practical sense, since it might be very difficult to convert those 
descriptions into a practical algorithm. We will discuss this more thoroughly in Sec- 
tion 3.4. As a consequence, most of our results give optimal benchmarks and can 
serve for the evaluation of old and new algorithms. We study and compare four kinds 
of approximation methods; see Section 2.1 for details. 

• We consider the class of all continuous linear mappings : F — > if, 

n 

SM-Y.^iU)-hi 

1=1 

with arbitrary hi & H. The worst case error of optimal linear mappings is 
given by the approximation numbers or linear widths 

e'^{S,F,H)^ mi e{S^,F,H). 

-'^Formally we only deal with Besov spaces. Because of the cmbcddings B^''^^*{Lp{il)) C 
Wp~'*+*(17) C S^''+*(Lp(r2)), which hold for 1 < p < oo, t > s, see [91], our results are valid 
also for Sobolev spaces. 
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• For a given basis B of H we consider the class J\fn{B) of all (linear or nonlinear) 
mappings of the form 

n 
k=l 

where the and the depend in an arbitrary way on /. We also allow that 
the basis B to be chosen in a nearly arbitrary way. Then the nonlinear widths 
€^^{3, F, H) are given by 

cm H) = inf inf e(5„, F, H). 

Here Be denotes a set of Riesz bases for H where C indicates the stabihty of 
the basis. These numbers are the main topic of our analysis. 

• We also study methods Sn with Sn = (fin ° where : F — > M" is linear 
and continuous and : — > if is arbitrary. This is the class of all (hnear 
or nonlinear) approximations Sn that use linear information of cardinality n 
about the right hand side /. The respective widths are 

rn{S,F,H) := Me{Sn,F,H), 

they are closely related to the G elf and numbers. 

• Let Cn be the class of continuous mappings, given by arbitrary continuous 
mappings : F — > and : ^ if. Again we define the worst case 
error of optimal continuous mappings by 

er\S,F,H)=M e{Sn,F,H), 

where Sn = (pn° Nn- These numbers are called manifold widths of S. 

For problems (3) with F — B^{Lp{Q)) our main results are the following. If p > 2 
then the order of convergence is the same for all four classes of approximations. In 
particular, the best linear approximations are of the same order as the best nonlinear 
ones. The best linear approximation can be quite difficult to realize as a numerical 
algorithm since the optimal Galerkin space usually depends on the operator and 
of the shape of the domain Q. For p < 2 there is an essential difference, nonhnear 
approximations are better than linear ones. However, in this case it turns out that 
linear information about the right hand side / is optimal. Our main theoretical tool 
is best n-term approximation with respect to an optimal Riesz basis and related non- 
linear widths. The main results are about approximation, not about computation. 
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However, we also discuss consequences of the results for the numerical complexity 
of operator equations. 

The paper is organized as follows: 

1. Introduction 

2. Linear and nonlinear widths 

2.1 Classes of admissible mappings 

2.2 Properties of widths and relations between them 

3. Optimal approximation of elliptic problems 

3.1 Optimal linear approximation of elliptic problems 

3.2 Optimal nonlinear approximation of elliptic problems 

3.3 The Poisson equation 

3.4 Algorithms and complexity 

4. Proofs 

4.1 Properties of widths 

4.2 Widths of cmbeddings of weighted sequence spaces 

4.3 Widths of embeddings of Besov Spaces 

4.4 Proofs of Theorems 2, 3, and 5 

5. Appendix - Besov spaces 

We add a few comments. The main results of our paper arc contained in Section 
3.2. They are further illustrated for the case of the Poisson equation in Section 3.3. 
A discussion in connection with uniform approximation, adaptive/nonadaptive in- 
formation, adaptive numerical schemes, and complexity is contained in Section 3.4. 
All proofs are contained in Section 4. Of independent interest are the estimates of 
the widths of embedding operators for Besov spaces, see Section 4.3. 
Notation. We write a x 6 if there exists a constant c > (independent of the 
context dependent relevant parameters) such that 

c~^ a < b < ca . 

All unimportant constants will be denoted by c, sometimes with additional indices. 

2 Linear and Nonlinear Widths 

Widths represent concepts of optimality. In this section we shall discuss several 
variants. Most important for us will be the nonlinear widths ej^°° and the linear 
widths e^^. We also study Gelfand and manifold widths and, as a vehicle of the 
proofs, Bernstein widths. 
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2.1 Classes of Admissible Mappings 
Linear Mappings Sn 

Here we consider the class of all continuous linear mappings Sn '■ F H , 

n 

(5) Sn{f) = Y,Hf)h, 

1=1 

where the Lj : F — > R are linear functionals and hi are elements of H. We consider 
the worst case error 

(6) e{Sn,F,H):= sup \\A-\f) - SMIh, 

II/IIf<i 

where F is a normed (or quasi-normed) subspace of G. Accordingly, we seek the 
optimal linear approximation, as well as the numbers 

(7) e'^{S,F,H)= mi e{Sn,F,H), 

usually called approximation numbers or linear widths oi S : F ^ H, ci. [60, 72, 73, 
85]. 

Nonlineeir Mappings Sn 

Let B — {hi, h2, . . .} be a subset of H. Then the best n-term approximation of an 
element u E H with respect to this set B is defined as 



(8) an.{u,B)H ■■= . inf inf 

,...,tn Ci 5... Cn 



hi 



k=l 



H 



This subject is widely studied, see the surveys [29] and [84]. Now we continue by 
looking for an optimal set B as has been done in Kashin [54] , Donoho [38] , Temlyakov 
[82, 83, 84] and DeVore, Petrova, and Temlyakov [33]. Temlyakov [84] suggested to 
consider the quantities 

inf sup aniu,B)H, 

where "D is a subset of the set of all bases of H. The particular case of V being 
the set of all orthonormal bases has been discussed in [82, 83], while the set of 
all unconditional, democratic bases is studied in [33]. Sec Remark 25 for a further 
discussion. In this paper we work with Riesz bases, see, e.g., Meyer [62, page 21]. 

Definition 1. Let H be a Hilbert space. Then the sequence hi, /i2, ■ ■ ■ of elements of 
H is called a Riesz basis for H if there exist positive constants A and B such that, 
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for every sequence of scalars ai, q;2, . . . with 7^ for only finitely many k, we 
have 



k k k 

and the vector space of finite sums ^akhk is dense in H. 

RemEirk 1. The constants A,B reflect the stability of the basis. Orthonormal bases 
are those with A — B — 1. Typical examples of Riesz bases are the biorthogonal 
wavelet bases on W'' or on certain Lipschitz domains, cf. Cohen [12, Sect. 2.6, 2.12]. 

In what follows 

(10) B^{hi\ieN} 

will always denote a Riesz basis of H with A and B being the corresponding optimal 
constants in (9). 

For a given basis B we consider the class NniB) of all (linear or nonlinear) mappings 
of the form 

n 

(11) 5„(/)=^Cfe/li„ 

fe=l 

where the Cfc and the ik depend in an arbitrary way on /. By the arbitrariness of Sn 
one obtains immediately 

(12) inf sup \\A-'f-SM\\H^ sup aM~'L^)H. 

SneMniB) \\fy<i \mF<i 

It is natural to assume some common stability of the bases under consideration. For 
a real number C > 1 we put 

(13) Be := [b : B/A < c}. 

We are ready to define the nonhnear widths e^°Q{S, F, H) by 

(14) e-5(5, F, H) = inf inf e{S^, F, H). 

BeBc SneAfn{B) 

These numbers are the main topic of our analysis. We call them the widths of best 
n-term approximation (with respect to the collection Be of Riesz basis of H) . 

Remcirk 2. i) It should be clear that the class Hn{B) contains many mappings 
that are difficult to compute. In particular, the number n just reflects the di- 
mension of a nonlinear manifold and has nothing to do with a computational 
cost. In this paper we also are interested in lower bounds, such lower bounds 
being strengthened if we admit a larger cass of approximations. 



1/2 
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ii) The inequality 

(15) e^^^iS,F,H)<e':riS,F,H) 
is trivial. 

(Hi) Because of the homogeneity of an, i-e., an{^u, B)h = \M^n{u,B)H, \ E M., it 
does not change the asymptotic behaviour of ej^°° if we replace sup||j||^<i by 
^^P||/IIf<c foi^ c > 0. 

Continuous Mappings Sn 

Linear mappings Sn are of the form Sn = ^Pn ° Nn where both Nn '■ F and 
ifn '■ H are hnear and continuous. If we drop the hnearity condition then 

we obtain the class of all continuous mappings C„, given by arbitrary continuous 
mappings Nn '■ F and : M"" — >■ H. Again we define the worst case error of 

optimal continuous mappings by 

(16) er\S, F, H) = inf e(5„, F, H). 

These numbers, or slightly different numbers, were studied by different authors, cf. 
[30, 31, 40, 60]. Sometimes these numbers are called manifold widths of 5*, see [31], 
and we will use this terminology here. The inequality 

(17) eT{S,F,H)<e':\S,F,H) 
is obvious. 

Gelfand Widths and Minimal Radii of Information 

We can also study methods Sn with Sn — <Pn° where : F — > IR" is linear and 
continuous and : — > is arbitrary. The respective widths are 

(18) rn{S,F,H):^mie{Sn,F,H). 

On 

These numbers are called the n-th minimal radii of information, which are closely 
related to Gelfand widths, see Lemma 1 below. The n-th Gelfand width of the linear 
operator S : F H is given by 

(19) d^(S,F,H):^ ini sup : ||/||^ < 1, L,(/) = , i = 1, . . . nj , 

Li,...,Ln (. J 

where the L,- : F ^ K are continuous linear functionals. 
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Bernstein Widths 



A well-known tool for deriving lower bounds of widths consists in the investigation 
of Bernstein widths, see [72, 73, 85]. 

Definition 2. The number bn{S, F, H), called the n-th Bernstein width of the opera- 
tor S : F ^ H, is the radius of the largest {n + l) -dimensional ball that is contained 

mS{{\\f\\F<l}). 

Renicirk 3. The literature contains several different definitions of Bernstein widths. 
For example, Pietsch [71] gives the following version. Let X„ denote subspaces of F 
of dimension n. Then 

K{S,F,H):= sup inf . 

XnCF ^^Xn,X^O \\x\\f 

As long as S is an infective mapping we obviously have bn{S, F, H) = bn+i{S, F, H). 



2.2 Properties of Widths and Relations Between Them 

Lemma 1. Let n e N and assume that F <Z G is quasi-normed. 

(i) We have <rn < 2d^ if F is normed and d"' x r„ in general. 

(ii) The inequality 

(20) bn{S, F, H) < min (e^iS, F, H), d^iS, F, H)^ 
holds for all n. 

Remeirk 4. The inequality bn < e™°* is known, compare e.g. with [30], and the proof 
technique (via Borsuk's theorem) is often used for the proof of similar results. 

The Bernstein widths bn can also be used to prove lower bounds for the e^°§. The 
following inequality has been proved in [24]. 

Lemma 2. Assume that F <Z G is quasi-normed. Then 

(21) e-:^{S,F,H) > ^bm{S,F,H) 
holds for all m > AG^n. 

More important for us will be a direct comparison of e^"^ and e^"^. Best n-term 
approximation yields a mapping 

n 
k=l 

9 



which is in general not continuous. However, it is known that certain discontinuous 
mappings can be suitably modified in order to obtain a continuous n-term approx- 
imation with an error which is only slightly worse, see, for example, [31] and [41]. 
We prove that, under general assumptions, the numbers ej^°^ can be bounded from 
below by the manifold widths ej^""*. 

Theorem 1. Let S : G ^ H be an isomorphism. Suppose that the embedding 
F ^ G is compact. Then for all C > 1 and all n we have 

(22) eZliiS, F,H)<2C\\S \\' \\ \\' e^^^{S, F, H) . 
Finally we collect some further properties of the quantities 6^°"^* and e°°°. 

Lemma 3. (i) Let m,n ^ N, and let F be a subset of the quasi-normed linear space 
X , where X itself is a subset of the quasi-normed linear space Y. Let Ij denote 
embedding operators. Then 

(23) e;,tn(^i, F, Y) < er (/2, F, X) e^^h, X, Y) 
holds. 

(ii) Let F be a quasi-normed subset of G and let I : F ^ G be the embedding. Then 

(24) eT{I,F,G) < \\S-'\\e^:^\S,F,H) < \\S-'\\ \\S\\ e^'il , F,G) 

and for any C > ||-S'~-^|| we have 
(25) 

en?c||5-i||||S||(-'^'-^'^) ^ ll-S""! e°°5(5',F,if) < \\S\\el°c/^\\s-m\s\\){^^F,G) . 

Remark 5. Let us point out the following which is part of the proof of Lemma 3. Let 
B = {hi, h2, ■ ■ ■} be a Riesz basis of G. Let Sn be an approximation of the identity 
I : F G. Then S{B) is a Riesz basis of H and S o Sn is an approximation of 
S : F ^ H satisfying 

(26) 11/ - Sn{f)\\G < \\S-'\\ -WSf-So Snif) \\h < \\S-'\\ • II 5 II • II / - Snif) \\g ■ 

This makes clear that if B and Sn are order optimal for the triple I, F, G, then S{B) 
and S o Sn are order optimal for the triple S,F,H. Consequently, instead of looking 
for good approximations of S : F ^ H it will be enough to study approximations of 
the embedding I : F ^ G. 
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Remark 6. The assertion in part (i) of the Lemma is essentially proved in [40] 
but traced there to Khodulev. The inequality (23) can be made more transparent by 
means of the diagram 

X ^ Y 

h \ / h 
F. 

Remark 7. The approximation numbers ej^°, the Gelfand widths d"- , the manifold 
widths e^°'^* and Bernstein widths bn are particular examples of s-numbers in the 
sense o/Pietsch [71], see [60] for the manifold widths. They have several properties 
in common. Letting Sn denote any of the numbers e"^, d", e^°"* and bn we have 

(27) Snin o Ti o To) < II To || || T2 \\ Sn{Ti) , 

where Tq e £{Eo,E), Ti e JC{E,F), T2 e C{F,Fq) and Eo,E,F,Fo are arbitrary 
Banach spaces. For these four types of s-numbers the assertion remains true also 
for quasi-Banach spaces. 

Another property concerns additivity. For s„ instead of e^™ and we have 

(28) S2n{To + T,)<c (s„(To) + s„(Ti)) , 

where Tq, Ti G C{E,F), E,F are arbitrary quasi-Banach spaces, and c does not 
depend on n,TQ,Ti, cf. [10]. In case that F is a Banach space, one can take c—1. 



3 Optimal Approximation of Elliptic Problems 

Let s,t > 0. We consider the diagram 

I \ / St 

B-^+\L,m, 

where St denotes the restriction of S to S~*+*(Lp(Q)) and / denotes the identity. 
We assume (3) and we let S — A~^. 

3.1 Optimal Linear Approximation of Elliptic Problems 

Theorem 2. Let il <zW^ be a bounded Lipschitz domain. Let < p,q < 00, s > 0, 
and 

(29) ^^''irl 
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Then 

Reiiicirk 8. i) The restriction (29) is necessary and sufficient for the compact- 
ness of the embedding I : B^^'^^{Lp{D,)) ^ cf. the Appendix, Propo- 
sition 7. 

a) The proof is constructive. First of all one has to determine a linear mapping 
Sn that approximates the embedding I : B^^^^{Lp{Q)) H~^{Q) with the op- 
timal order. How this can be done is described in Remark 28, Subsection 4.3.3. 
Finally, the linear mapping S o Sn realizes an in order optimal approximation 
of St. 

Hi) There are hundreds of references dealing with approximation numbers of linear 
operators. Most useful for us have been the monographs [43, 72, 73, 85, 81, 94], 
as well as the references contained therein. 

3.2 Optimal Nonlinear Approximation of Elliptic Problems 

To begin with, we consider the manifold and the Gelfand widths. There we have a 
rather final answer. 

Theorem 3. Let fl gM!^ be a bounded Lipschitz domain. Let < p,q < oo, s > 0, 
and 

t>d{--- 
\P 2 

Then 

er\S, 5-+*(L,(Q))), H^(n)) X n-'/' 
If, in addition, p > 1 (and t > d/2 if 1 < p < 2), then 

d^iS, R-+*(L^(Q))), H^.m X n-'/' . 



From Theorem 1 and Theorem 3 we conclude that the order of e°°^ is also at 
least n"*/*^. For the respective upper bound of the nonlinear widths e°°^ we need a 
few more restrictions with respect to the domain ft. Let Q be a bounded Lipschitz 
domain in R*^ and let s > 0. We assume that for any fixed triple {t, p, q) of parameters 
the spaces B~^^^{Lp{Vt)) and H~^{Q) allow a discretization by one common wavelet 
system B\ i.e. (107)-(112) should be satisfied with B-'+\Lp{Q)) and B^\L2{ft)), 
respectively, cf. Appendix 5.10. By assumption such a wavelet system belongs to 
Be* for some 1 < C* < oo. 
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Theorem 4. Under the above conditions on and if < p,q < oo, s > Q, t > 
d{^ — |)+, we have for any C > C* 

Remark 9. Comparing Theorems 3, 4 and Theorem 2 there is a clear message. For 
p < 2 there are nonlinear approximations that are better in order than any linear 
approximation. 

RemEirk 10. The proof of the upper bound in Theorem 4 is constructive in a the- 
oretical sense that we now describe. Given a right-hand side f e S~*+*(Lp(Jl)) we 
have to calculate all wavelet coefficients (/, ipj,x) . The sequence of these coefficients 
belongs to the space b~g'^^{'V), cf. Subsection 4.2. With 

« = («j,A)j,A , %,A — (/, i^j,x) , for all j, A , 

we find a good approximation Sn{a) of a with n components with respect to the 
norm \\ ■ \bl 2(V)|| in Proposition 2. To get an optimal approximation of the solution 
u = Sf in II ■ |if'*(f2)|| we have to apply the solution operator to Sn{a). Hence 

K 

(30) M„ = (5 o Sn)ia) =^Y1 S*,A ' 

j=0 A6A* 

where K = K{a,n), with a* ^ and A* as in Proposition 2 (cf. in particular (62) 
and (65)^, represents such a good approximation of u. To calculate Un, a lot of 
computations have to he done. The coefficients a*^ are the largest in a weighted sense 
(the weight depends on n and j , cf. the proof of Proposition 2 for explicit formulas). 
Having these coefficients at hand one has finally to solve all the equations 

(31) Auj,x^'^j,x: 0<j<K, AeA* 

to obtain Uj^x — Sipj^x- The number of equations is 0{n). 

In this way we obtain a nonlinear approximation with respect to the Riesz basis 
given by the Sipj^x- Observe that this Riesz basis depends on the operator equation. 
It would be much better to use a known Riesz basis, such as a wavelet basis, that 
does not depend on A. See Theorem 5 for a step into that direction. 

Remark 11. At least ifD, is a cube, all required properties are known to be satisfied if 
in addition 1 < p,q < oo. The latter restriction allows to use duality arguments, cf. 
Proposition 10 in Appendix 5.8. There also exist results for domains with piecewise 
analytic boundary such as polygonal or polyhedral domains. One natural way as. 
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e.g., outlined in [8] and [26], is to decompose the domain into a disjoint union of 
parametric images of reference cubes. Then, one constructs wavelet bases on the 
reference cubes and glues everything together in a judicious fashion. However, due 
to the glueing procedure, only Sobolev spaces with smoothness s < 3/2 can be 
characterized. This bottleneck can be circumvented by the approach in [27]. There, 
a much more tricky domain decomposition method involving certain projection and 
eoctension operators is used. By proceeding in this way, norm equivalences for all 
spaces i?*(Lp(r2)) can be derived, at least for the casep > 1, see [27, Theorem 3.4.3]. 
However, the authors also mention that their results can be generalized to the case 
p<l, see [27, Remark 3.1.2]. 

Sobolev and Besov spaces on compact C°° -manifolds were already characterized 
via spline bases and sequence spaces by Ciesielski and Figiel [11]. In that paper also 
the isomorphism between function spaces and sequence spaces is used to obtain results 
for various s-numbers. 

Remark 12. Comparing Theorems 3 and A we see that the numbers e°°^, 6^°"^*, and 
d"' have the same asymptotic behaviour, at least for p > 1. Using the relation rf" x r„, 
see Lemma 1, we actually can get the optimal order n^^^'^ with an approximation of 
the form 

(32) /^5o^„o7V„(/), 
where 

is linear (this mapping gives the information that is used about the right hand side), 
and 

V9„ : ^ H-'{n) 

is nonlinear. Note that neither Nn nor depend on S. The mapping ipn o Nn gives 
a good approximation of the embedding from B~^~^^{Lp{fl)) to H~^. 

Remark 13. There is a further little difference between linear and nonlinear ap- 
proximation. Let us consider the limiting case t — d{l/p — 1/2), where < p < 2. 
Then the embedding B~^^*{Lp{Q)) ^ H~^{D,) is continuous, not compact. As a 
consequence 

e|i"(5,5-+*(Lp(fi))),i^^(n))/.0 if n^oo, 

but 

ens, B;^+\Lp{m, H^m) ^ O if n^oc, 

cf. Remark 26. 
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3.3 The Poisson Equation 

The next step is to discuss the specific case of the Poisson equation on a Lipschitz 
domain Q contained in B?: 

(33) -A-u ^ f in n 

u — on dfl. 

As usual, we study (33) in the weak formulation. Then, it can be shown that the 
operator ^ = A : Hq — ^ is boundedly invertible, see, e.g., [50] for details. 
Hence Theorems 2 and 3 apply with s = 1; for the upper bound of Theorem 4 we 
need some restrictions with respect to Jl. For the proof of Theorem 4 we used the 
Riesz basis Sipj^x, which depends on A. Now we want to approximate the solution 
u by wavelets. 

We shall restrict ourselves to the case that Q is a simply connected polygonal domain. 
The segments of dQ are denoted by ri,...,r;v, where each F; is open and the 
segments are numbered in positive orientation. Furthermore, denotes the endpoint 
of Fi and cui denotes the measure of the interior angle at T^. Appropriate wavelet 
systems can be constructed for such a domain, see Remark 11. Then we obtain the 
following. 

Theorem 5. Let Q be a polygonal domain in M^. Let 1 < p <2 and let k > 1 be a 

nonnegative integer such that 

TflTT 2 

^ A; + 1 - - for all m e N, Z = 1, . . . , A^. 

P 

Then for an appropriate wavelet system B* , the best n-term approximation of prob- 
lem (33) yields 

(34) sup an(M,i3*) < c,n-'=/^+" 

||/|s^-i(Lp(n))||<i 

where £ > and do not depend on n. 

Reiiicirk 14. This approximation differs greatly from the one described in Re- 
mark 10. Here we can work with one given wavelet system to approximate the solution 
u. We are not forced to work with the solutions of the system (31). A more detailed 
discussion of these relationships, including possible numerical realizations of wavelet 
methods, will follow in Section 3.4. 
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3.4 Algorithms and Complexity 

So far, we have studied the error e(5'„, F, H) of approximations We compared 
the error of nonhnear Sn and hnear Sn and proved results on the optimal rate of 
convergence. We assume that (1) is a given fixed operator equation and hence, in 
the case of (3), also Q is fixed. 

In this section we briefly discuss algorithms and their complexity, and for simplic- 
ity we still assume that the operator equation (3) is given and fixed. Observe that 
in practice it is important to construct also algorithms for more general problems: 
We want to input information about Q and A and the right hand side /, and we 
want to obtain an e-approximation of the sohition u. In our more restricted case we 
only have to input information concerning the right hand side / because fl and A 
are fixed. 

As is usual in numerical analysis, we use the real number model of computation 
(see [64] for the details and [66] and [67] for further comments). Any algorithm com- 
putes and/or uses some information (consisting in finitely many numbers) describing 
the right hand side / of (3). There are different ways how an algorithm may use 
information concerning /, we describe two of them in turn. 

1. The information used about / is very explicit if Sn is linear (5): Then the 
algorithm uses Li(/), . . . , L„(/) and we assume that we have an oracle (or 
subroutine) for the Li{f). In practical applications the computation of a func- 
tional Li{f) can be very easy or very difficult or anything between. One often 
assumes that the cost of obtaining a value Li{f) is c where c > is small or 
large, depending on the circumstances. 

As in (11), we can imagine Sn as the input-output mapping of a numerical 
algorithm: on input / G F we obtain the output Sn{f) — Un — Yl^=i'^khi^,. 
More formally we should say that the output is 

(35) out(/) = (ii, Ci, i2, C2, . . . , in, Cn) 

but we identify out(/) with Un- Of course we cannot consider arbitrary map- 
pings Sn of the form (11) as the input-output mapping of an algorithm, since 
not all such 5'„ are computable. 

We still assume that we only have an oracle for the computation of hnear 
functionals Li{f). Then it is not so clear what the information cost of (11) is, 
since (11) only describes the (desired) output of an algorithm, it is not an algo- 
rithm by itself. We need an algorithm that uses information Li{f), . . . , L^if), 
where N might be bigger than n, to produce the ik and the of out(/). The 
information cost of such a procedure would be cN. 
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2. One also can assume that a good approximation /„ can easily be precomputed 
with negligible cost. Hence the algorithm starts with an approximation 



fc=i 

such as a best n-term approximation (or a greedy approximation) of / with 
respect to a basis {gi, : i e N}. 

This is a good place for a short remark about adaption. The use of adaptive 
methods is quite widespread but we want to stress that the notion of adaptive meth- 
ods is not uniformly used in the literature. Some confusion is almost unavoidable if 
such different notions are mixed. To avoid such confusion, we do not use the notion 
of an "adaptive method". Instead we speak first about adaptive (or nonadaptive) 
information and then about adaptive numerical schemes. 

• Nonadaptive information: The algorithm uses certain functionals Li, L2, . . ■ , 
and for each input f E F the algorithm needs Li{f), L2{f), . . . , Ln{f)- Hence 
the functionals Li do not depend on /. In this case we say that the algorithm 
uses nonadaptive information. 

• Adaptive information: The algorithm uses Li{f) and, depending on this num- 
ber, the next functional L2 is chosen. In general, the chosen functional 
may depend on the values Li{f), . . . , Lk-i{f) that are already known to the 
algorithm. Observe that cannot depend in an arbitrary way on / since the 
algorithm can only use the known information about /. In this case we say 
that the algorithm uses adaptive information. 

We give an example. Assume that a certain Sn of the form (11) can be realized 
in such a way that we first compute Li{f ), . . . , LN{f), where the Li do not depend 
on / G -F. In the latter parts of the algorithm we only use the Li{f) for the n 
largest values of \Li{f)\, together with the corresponding values of i, to compute the 
output out(/). Such an algorithm uses nonadaptive information (of cardinality A^), 
the information cost is cN. 

There is a large stream of results, giving conditions under which adaptive in- 
formation is superior (or not superior) compared to nonadaptive information; we 
mention the pioneering paper by Bakhvalov [2], the results on operator equations 
by Gal and MiccheUi [44] and by Traub and Wozniakowski [86] , and the survey [65] . 
For example, it is known that adaptive information does not help (up to a factor of 
2) for linear operator equations and the worst case error with respect to the unit ball 



n 



(36) 
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of a normed space F. If F is only quasi-normcd then the proofs must be modified, 
with a possible change of the constant 2. Nevertheless nonadaptive information is 
almost as good as adaptive information. 

How much information is needed about the right hand side f E F in order that 
we can solve the equation (1) with an error e? This question is answered by the 
minimal radii of information rn{S, F, H) (or the closely related Gelfand numbers). 
These numbers are a good measure for the information complexity of the operator 
equation. In contrast, the output complexity of the problem is measured by the 
nonlinear widths e^°^{S, F, H). These numbers measure the cost of just outputting 
the approximation (with respect to an optimal basis B e Be) - It is quite remarkable 
that, under general conditions, we obtain the same order 

r„(5, F, H) X ci"(5, F, H) x 6-^(5, F, H) x n-"\ 

see Theorem 3 and Theorem 4. 

Now we discuss adaptive numerical schemes for the numerical treatment of el- 
liptic partial differential equations. Usually, these operator equations are solved by 
a Galerkin scheme, i.e., one defines an increasing sequence of finite dimensional ap- 
proximation spaces Ga, := span{?7^ : fi e A;}, where Ga; C Ga;^_i, and projects the 
problem onto these spaces, i.e., 

{AuKi.v) ^ {f,v) for all v E Ga^. 

To compute the actual Galerkin approximation, one has to solve a linear system 

Aa,ca, = fAp Aa, = ((^V' nn))n,n'€Kr = (/, r]^), 1^ e Ai. 

Then the question arises how to choose the approximation spaces in a suitable way, 
since doing that in a somewhat clumsy fashion would yield huge linear systems and 
a very unefficient scheme. One natural way would be to use an updating strategy, 
i.e., one starts with a small set Aq, tries to estimate the (local) error, and only in 
regions where the error is large the index set is refined, i.e., further basis functions 
are added. Such an updating strategy is usually called an adaptive numerical scheme 
and it is characterized by the following facts: the sequence of approximation spaces 
is not a priori fixed but depends on the unknown solution u of the operator equation, 
and the whole scheme should be self-regulating, i.e., it should work without a priori 
information on the solution. In principle, such an adaptive scheme consists of the 
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following three steps: 



solve 



estimate 



refine 



Aa^Ca, - fA( 



\\u - Ma, II =^ 
a posteriori 
error estimator 



add functions 



if necessary. 



Note that the second step is highly nontrivial since the exact solution u is unknown, 
so that clever a posteriori error estimators are needed. These error estimators should 
be local, since we want to refine (i.e. add basis functions) only in regions where the 
local error is large. Then another challenging task is to show that the refinement 
strategy leads to a convergent scheme and to estimate its order of convergence, if 
possible. 

Recent developments indicate the promising potential of adaptive numerical schemes, 
see, e.g., [1, 3, 4, 5, 39, 80, 93] for finite element methods. However, to further ex- 
plain the ideas and to make comparisons as simple as possible, we shall restrict 
ourselves to adaptive schemes based on wavelets. For simplicity, we shall mainly 
discuss the approach in [21]; for more sophisticated versions the reader is referred 
to [13, 14, 15, 22]. The first step clearly must be the development of an a posteriori 
error estimator. Using the fact that A is boundedly invertible and the usual norm 
equivalences, compare with (112), we obtain 

(37) - liAllif- ^ ||.4('u - iiA)||if-= 



Prom (37), we observe that the sum of the residual weights gives rise to an efficient 
and reliable a posteriori error estimator. Each residual weight 5j^x can be interpreted 
as a local error indicator, so that the following natural refinement strategy suggests 



rA\\H-' 




where the residual weights 5j^x can be computed as 
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itself: Add wavelets in regions where the residual weights are large; that is, try 
to catch the bulk of the residual expansion in (37). Indeed, it can be shown that 
this strategy produces a convergent adaptive scheme, in principle. However, we are 
faced with a serious problem: the index set J will not have finite cardinality, so that 
neither the error estimator nor the adaptive refinement strategy can be implemented. 
Nevertheless, there exist implementable variants, see again [13, 21] for details. We 
start with the set 

Jj,x,e- {(j',A')| |(^V'j',A','0j,A)| £-significant} 

and define 

%,a(A, e) := 2-'^ \ (^V'j'.A', i^j,\)uj>,\' \ ■ 

(The expresssion 's-significant' can be made precise by using the locality and the 
cancellation properties of a wavelet basis). By employing the aj^\{h.,e) we obtain 
another error erstimator: 

\\u-u^\\h^ <c - {( a]^ + e\\f\\H-'^+ird\\F -v\\h-s\. 

Here Va denotes the approximation space spanned by the dual wavelets corre- 
sponding to A, see Section 5.3 for details. Now, playing the same game for the 
aj^x{A,e) instead of the 6j^\, we end up with a convergent and implementable adap- 
tive strategy. To this end, the starting index set A has to be determined such that 
inf^gy^ 11/ — < c • eps and £(/, eps, 9) has to be computed. Then, there exists 

a constant k e (0, 1) such that whenever A C J, A C A is chosen so that 

(38) %,a(A,£)M >(1-^) Y1 %.a(^'^) 
\0-,a)ga\a / \0-,A)GJ\A 

either 

(39) ||w — li^^ll < — wa||, k e (0, 1) 
or 

(40) J2 «i.A(A,£)' <eps 

\{j,A)eJ\A / 

which implies that 

(41) \\u — u\\\ < eps • c. 
For the proof and further details, the reader is again refered to [21]. 
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Remark 15. i) In order to avoid unnecessary technical and notational difficul- 
ties, we have not presented the explicit form of the function e{f,eps,9). It 
depends in a complicated, but nevertheless computable way on the final accu- 
racy eps, the control parameter 9 , the -norm of the right-hand side f , 
and on the stability and ellipticity constants of the problem. For details, we 
refer again to [21]. 

a) The norm \\ ■ \\ in (39) and (41) clearly denotes the energy norm \\v\\ {Av, v), 
which is equivalent to the Sobolev norm , see again [50] for details. 

Hi) Eqs. (39), (40) and (41) obviously imply that the adaptive strategy in (38) 
converges. Indeed, the error is reduced by a factor of k at each step until the 
sum of the significant coefficients in (40) is smaller than the final accuracy, 
which by (41) means that the same property holds for the current Galerkin 
approximation. 

iv) Although the sum in the right-hand side of (38) formally still contains unfinitely 
many coefficients, it can be checked that this sum in fact runs over a finite set, 
so that the adaptive strategy is implementable. 

Let us now compare this concept of adaptivity with the notion of adaptive infor- 
mation explained above: 

• From the discussion presented above, we have seen that adaptive wavelet 
schemes are not performed by gaining more and more information from the 
right-hand side / in an adaptive fashion. Instead they use the residual which 
depends on the right-hand side, the operator, and the domain. Moreover, we 
see that the starting index set A is determined by the wavelet expansion of 
the right-hand side. That is, A is given by some kind of best n-term approxi- 
mation of /, which is assumed to be available or to be easily computable. In 
this sense, the adaptive wavelet schemes require nonlinear information about 
the problem. 

• In the wavelet setting, the benchmark for the performance is the approximation 
order of the best n-term approximation of the solution, i.e., the numbers 

(42) sup an{A-'f,B)H. 

\\f\\F<l 

It has been shown quite recently in [13] that a judicious variant of the algo- 
rithm outlined above gives rise to the same order of approximation as best 
n-term approximation, while the number of arithmetic operations that are 
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needed stays proportional to the number of unknowns. Here the authors im- 
pUcitly assume that certain subroutines for fast matrix-vector multiphcations, 
approximations of the right-hand sides and for thresholding are available, and 
that all these routines have to realize a given approximation rate. Moreover, 
it is assumed that the solution u is contained in some Besov space Bp{Lp{Q)), 
and hence F is a suitable subset of A{Bp {Lp{fl))) , i.e., the admissible class of 
right hand sides depends on the operator A. Observe that, for given F and B, 
the numbers eJ^°^(S', F, H) might be much smaller than the numbers in (42) 
since it is, in general, not clear whether a wavelet basis is optimal. 

• The performance of an adaptive scheme is not compared with an arbitrary 
linear scheme. The reason for that is simple, and has already been explained 
earlier. It is indeed true that linear approximation often produces the same 
order as nonlinear (best n-term) approximations, see Theorem 2 and Theorem 
4. However, for nonregular problems, it would be necessary to precompute the 
optimal basis S{gi) in advance, which is mostly too expensive and should be 
avoided in practice, see [24] for further details. One usually compares adaptive 
schemes with uniform methods for then a precomputation is not necessary. 
Therefore the use of an adaptive wavelet scheme is justified if it performs bet- 
ter than any uniform scheme. It is known that the order of approximation of 
uniform schemes is determined by the Sobolev regularity H^iVt) of the object 
we want to approximate whereas the approximation order of best n-term ap- 
proximation depends on the regularity in the specific Besov scale S*(L^(r2)), 
where 

1 _t-s 1 
r "^^2' 

see [20, 29] for details. Therefore adaptive schemes are justified if the Besov 
regularity of the exact solution is higher than its Sobolev regularity. For elliptic 
boundary value problems, there exist now many results in this direction, see, 
e.g., [16, 17, 18, 19, 23]. 

• In approximation theory, an approximation scheme that comes from a sequence 
of linear spaces that are uniformly refined is also called linear approximation 
scheme, which sometimes causes misunderstandings because these schemes are 
only special cases of the linear schemes considered, e.g., in Theorem 4. To avoid 
this confusion, we used the term uniform methods instead of linear methods. 

Remcirk 16. In this paper we study the complexity of solving elliptic partial differen- 
tial equations. We only deal with the deterministic setting. The randomized setting, 
where also the use of random numbers is allowed, is studied by Heinrich [51]. The 
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complexity of solving elliptic PDF in the quantum model of computation (where one 
can use a certain nonclassical randomness) is studied in [52]. 

4 Proofs 

4.1 Properties of Widths 

Proof of Lemma 1. Step 1. Part (i) is proved in [87] for the case where F is normed. 
The general case is similar. 

Step 2. To prove part (ii), we assume that -S'dH/Hir < 1}) contains an (n + 1)- 
dimensional ball B C H oi radius r and that : F — > is continuous. Since 
S~^{B) is an (n + 1) -dimensional bounded symmetric neighborhood of 0, it follows 
from the Borsuk Antipodality Theorem, see [28, paragraph 4], that there exists an 
/ e dS-\B) with NM = ^n(-/) and hence 

for any mapping : ^ G. Observe that = 1- Because of ||S'(/)— /)|| = 
2r and = Sn{—f) we obtain that the maximal error of Sn on {±/} is at least r. 

This proves 

b^(S,F,H)<e'i:-\S,F,H). 

Since we did not use the continuity of also bn{S, F, H) < d"{S, F, H) follows. 
□ 

Proof of Lemma 3. Step 1. Proof of (i). A corresponding assertion with X and 
Y normed linear spaces has been proved in [40]. This proof carries over without 
changes. 

Step 2. Proof of (25). Let B — {hi, /i2, ■ ■ ■ } be a Riesz basis of G with Riesz constants 
A,B > 0. Let this basis B and a corresponding mapping be optimal with respect 
to /, F, G (up to some £ > if necessary). Then the image of B under the mapping 
5" is a Riesz basis of H with Riesz constants A' — A/\\S~^\\ and = S US'!!. Prom 

\\Sf-{SoS^)f\\H<\\S\\ ||/-5„(/)||g 

it follows that 

<:c\is-m(^^F^H)^\\^\\<c(I,F,G). 

Replacing C by C/(||5-i|| \\S\\), the right-hand side in (25) follows. 

Now, let B C if be a Riesz basis with Riesz constants A,B > 0. Let B and a 
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corresponding Sn be optimal with respect to S, F, H (again up to some £ > if 
necessary). From 

II // - {S-' O Sn)f \\g < \\S-'\\ II Sf - Snif) \\h 

it follows that 

en:c\\s-m\s\iiI,F,G)<\\S-'\\el^^^{S,F,H). 
The proof of (24) follows from (27). □ 



Next we turn to the proof of Theorem 1. It is convenient for us to start with a 
simplified situation. For this we assume that K C H is compact. We define 

(43) el^^^{K,H)=M sup a{u,B) 

BeBc ueK 

and 

(44) e'^^\K, H) = inf sup ||(/^„(7V„(ii)) - ul 

where the infimum runs over all continuous mappings : M" H and : K — > 
M". We prove the following result. 

Proposition 1. Let K C H be compact. Then 

(45) eZl,iK,H)<2Cel^^{K,H). 

Proof. Let B G Be be given. Since K is compact, we only need finitely many elements 
of B, in the sense that 

(46) sup \\u — L}sf{u)\\ < e 

ueK 

for 

N 



(47) LN{u) = ^ajhj 

i=i 



Here Ljv is the orthogonal projection onto the space that is generated by /ii, . . . , /i^v- 
The functionals aj are linear and continuous. Moreover, we know that 



N \ N / N \ 

2 



(48) ^(EK-n <iiE"Aii<i? E 



l«il 
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with B/A < C. We may assume that A = 1. For a suitable B G Be we obtain 



(49) sup 
ueK 



^Ckhi^ - Lm{u) 



<e^:UK,H)+e. 



k=l 

Let /3 > 0. We define a modification of L^v by 

TV 

(50) L^(«) = 5^a*/., 

where = % if |aj| > 2/3 and a* = if |aj| < To make the continuous we 
define 

a* = 2sgn(aj) ■ {\aj\ — (3) 

for \aj\ G (/?, 2(3). We prove certain statements about Ly and denote the best n-term 
approximation of u by «„. 

Assume that for u E K, there are m > n of the aj, see (47), such that \aj\ > {3. 
Then we obtain 

\\un-LN{u)\\ > {m-ny^^p 

and with (49) we obtain 

(51) m-n<^(e-S(K,if)+s)2. 

Now we consider the sum Oj for u E K. We distinguish between those j that 

are used for Un (there are only n of those j) and the other indices and obtain 

|a^P</3 

Now we are ready to estimate \\L*j^{u)—Ljq{u)\\ forti G K. Observe that |a* — aj| < /? 
for any j. We obtain 

\\LUu) - L^{u) II < B{m(3^ + n(3^ + (e°^(X, //) + eff/^ 

Using the estimate (51) for m, we obtain 

||L^(^) - L^{u) II < B{2nP' + 2{e-^^{K, H) + effl\ 

Now we define /3 by 

n/3^ = (e;i^(i^,i/) + £)2 

and obtain the final error estimate (where we replace, for general A, the number B 
by B/A) 

\\LUu)-Lr,{u)\\<'^-^{e-:^{K,H) + e). 
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In addition we obtain 

m <2n 

and therefore L%- yields a continuous 2n-term approximation of u & K with error 
at most 

sup ||L^(«) -u\\<^ {e^:^{K, H) + e) + e. 

The mapping is continuous and the image is a complex of dimension 2n, see, e.g., 
[31]. Hence we have an upper bound for the so-called Aleksandrov widths, see [31] 
and [79]. By the famous theorem of Nobeling, any such mapping can be factorized 
as = (p4n+i o N^n+i where A^4„+i : K and </?4n+i : H are 

continuous. Hence the result is proved. □ 

Proof of Theorem 1. The unit ball of F is a compact subset of G by assumption. 
Prom Proposition 1, we derive that 

eZliiI,F,G)<2Cel^^^{I,F,G). 

Next we apply Lemma 3(ii), and obtain 

eT{S,F,H)<\\S\\erV,F,G), 

as well as 

(^l°ci^^F,G) < ll^""^!! e°°g/(||5-i||||5||)(5',F,if) . 
Combining these inequalities, we are done. □ 



4.2 Widths of Embeddings of Weighted Sequence Spaces 



Having the wavelet characterization of Besov spaces in mind, cf. Subsections 5.3 
and 5.4, we introduce the following scale of sequence spaces. 

Definition 3. Let < p,q < oo and let s e R. Let V := (Vj)j be a sequence of 
subsets of finite cardinality of the set {1,2,...,2'^ — IjxZ'^. We suppose that there 
exist < Ci < C2 and J e N such that the cardinality \ V, | of V satisfies 



(52) 



Ci < 2-^'^ \Vj\< C2 for all j>J. 



Then bpg^V), where < q < 00, denotes the collection of all sequences a — (aj,A)j,A 
of complex numbers such that 

n 



(53) 



j=0 



j^s + d{l-l))q/ 



P 



^ Aev, 



«i,A 



q/p 



< 00 . 



/ 
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For q = oo, we use the usual modification 

j{s + d{ )) / \ 

(54) ll«lk^^:= sup 2 ^ P [Yl <oo. 

^=''''- \xev, J 

If there is no danger of confusion we shall write bp ^ instead ofh^^^iV). 

Remark 17. In what follows, we shall let ej^x denote the elements of the canonical 
orthonormal basis 0/62,2- -^^^ cr e M. /t is obvious that the linear mapping L„ defined 
by 

L„ Cj- A := 2"'"^ ej,A for all j, A , 

extends to an isomorphism from bp ^ onto 6^^*^ (simultaneously for all s,p,q) with 
\\La\\ = 1- 

In the framework of these sequence spaces it is very easy to prove embedding theo- 
rems, cf. [57]. 

Lemma 4. Let < Po,Pi, Qo, ?i < 00, s G M, and t >0. 

(i) The embedding 

Po,qo \ ^ ) pi,qi\ ) 

exists (as a set theoretic inclusion) if and only if it is continuous if and only if either 

(55) t>d(—-—) 

\Po PiJ + 

or 

t — d \ I and Qo ^ Qi ■ 

(ii) The embedding 

is compact if and only if (55) holds. 

The main result of this subsection consists in the following: 

Theorem 6. Let < p,po,Pi < 00, < q, qo, qi < 00, and s e R. 
(i) Suppose that 

(66) *><r^). 
holds. Then, for any C > 1, we have 

e^I, by) X n*/^ . 
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(ii) Suppose that (56) holds. Then we have 



n-^l'^ if 2 < p < oo, 



(iii) Suppose that (55) holds. Then we have 

cont/j is+t IS \ -t/d 

Remcirk 18. In part (i) there is an interesting limiting case. Suppose < p < 2 
and t — d{l/p — 1/2). Then the embedding 6^^* ^ 6| 2 exists, of. Lemma 4, and 

[n*/'' (7„(a, B)6|^ - j < 00 if and only if a e bp-^p . 

In view of Lemma 4{ii), this shows that lim„_^oo e°°f^(5', F, i7) = does not imply 
compactness of S. 

The proof of Theorem 6 requires some preparations. It will be given in Subsections 
4.2.2-4.2.4. 

4.2.1 The Bernstein Widths of the Identity Operator 

We concentrate on the estimate from below. For later use we treat a more general 
situation. 

Lemma 5. Let < PQ,Pi,qQ,qi < 00, s G M and t > such that (55) holds. Then 
there exists a positive constant c such that 

(57) b{I¥+' b^ ^yd""'"' if 0<po<p,<oo, 

^ ' ^"^^'%,.o'V9J-^| ^-t/d+i/po-iM if o<p^<po<oc. 

holds for all n. 

Proof. The Bernstein numbers are monotonic in n. So it will be enough to prove 
the assertion for sufficiently large n. Consequently, we may assume that there is a 
natural number N > J, as well as positive constants Ci and C2, such that 

ci 2^^^ < n < C2 2^^^ . 



28 



Step 1. Let < po ^ Pi- Using Holder's inequality we find 

l/po 



< C2 2^*|| J] bxeN,x\b; 

AeVjv 



AeVjv AeVj 

i/pi 

i&Arj 

" AeVjv 



AgMat 

where C2 corresponds to (52). Consequently, the unit ball in ^p^J^^y contains the n- 
dimensional ball (spanned by the vectors cn^x, A G Vat) with radius Cg^n"*/'^. This 
proves 

for some positive constant c independent of n. 

Step 2. If po > Pii then Holder's inequality (used in the second line of the estimate 
in Step 1) will be replaced by the monotonicity of the ^^-norms and we obtain 



AeVjv AeVjv 

< 2N{s+t+d/2-d/po) ^ ^ I&aT'^^''^' 
AgVat 

< 052-'^^*''"'^/^^""''/*'°) 



E ^AeiV,A|&^ 

AeVjv 



Pl>l?l 



This time the unit ball in fep^^p contains the n-dimensional ball with radius 

^-1 2-N{t+d/pi-d/po) 

This proves our claims. □ 

Remark 19. In the one- dimensional periodic situation, estimates of the Bernstein 
numbers from above are also known, due to Tsarkov and Maiorov, cf. [85, Thm. 12, 
p. 194]. Let 1 < p < 00 and s > 0. By we denote the collection of all 27r- 
periodic functions f with Weyl derivative of order s belonging to Lp{T) and satisfying 
/-Ttt fix)dx = 0. Then 

if 1 <Pq <pi <oo or 

1 < Pi < Po < 2 and i > , 
^-t+i/po-i/pi 2 < pi < po < 00 and t > 1/po , 

^ ^-t+i/po-i/2 If 1 < Pi < 2 < po < 00 and t > 1/po • 

This should be compared with Lemma 5 for s — and d — 1. 
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4.2.2 Best m-Term Approximation in the Framework of Sequence Spaces 

We prepare the proof of part (i) of Theorem 6. Also, here we treat a more general 
situation. Let B denote the canonical basis (ej,A)j,A in ^2,2 (^)- Then our aim in 
this subsection consists in a characterization of the behaviour of the best m-term 
approximation of a given element a e ^p^^g with respect to B. 
The main result of this subsection reads as follows: 

Theorem 7. Let < Po,Pi, Qo, Qi < C)0; s e R and t > such that (55) holds. Then 
we have 



(58) 



sup{(7„(a,B)6.^ ^^ : \\a\\,s+t^^ < l} 



n 



-t/d 



We start with some preparations. Let U denote the unit ball in Then 



and sup 2^(^+W2-i/Po))(^^ |^._^|Poj 
j=o,i,- xeVi 



The following lemma will be of some use: 
Lemma 6. Let < po < pi and suppose that 



(59) 



t > d 



1 1 



^Po Pi' 

For all a & U and all n> 1 there exists a natural number K :— K{a, n) such that 

K 



holds. 



J2 J2 ^j'^^i'^ 



pi,<ii 



Proof. We define 



Then one has 



Tj := Yl ^^^^ ^J'^ ' i = 0' 1 ■ ■ ■ ■ 



Aev,- 



K 



3=0 AeVj j>K 

Since of < po < pi < 00, the monotonicity of the £g- norms and a G U lead to 



T. I/)* II < 03{s+d/2-d/pi) ( Sr~^ 

J- 3 |Opi,gill - ^ \ 2^ 

Aev. 



«7,A 



1/po 
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Let u = min(l,pi, gi). Consequently, using (59) and chosing K large enough, we 
find 

j>K j>K j>K 

< (J^ 2-Ku{t+d{l/po-l/pi)) 7j,-W<i 

This proves the claim. □ 

The basic step in deriving an upper estimate of (7„(a, B) is the following proposition. 
Again U denotes the unit ball in 6^^^^. 

Proposition 2. Let < po < pi < oo. Let a E U , n E N, and let K = K{a,n) be 
as in Lemma 6. Then there exists an approximation 

K 

(60) Sna := ^^a*,xej,x 

of a, which satisfies the following: 

i) The coefficients a*^^ depend continuously on a. 

a) The number of nonvanishing entries is bounded by c- n. 

Hi) \\ a — Sna\bp^g^\\ < cn^*/"^ , n = l,2, .... 
Here c can be chosen independent of a and n. 

Proof. Observe that it will be enough to prove the claim for natural numbers n — 
2^'^, where e N. We define 

^ ^ t - d (1/po - iM) 

2(l/po-lM) ' 

if 1 < i < 



(61) £ 



3 



(62) A* := {agV,-: \aj,^\r^ > Sj} , j = 0, 1, . . . . 
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Then, if j > N, 



Ka 



PO 



PO 



IPO 
oo ll 



(63) I A* I = - J2 

AeA] AeA] 

< ^ 2-?''(i/2-i/po)po2J*po2-0'--^)<52J'*po I ^ |po 

< n2-(^'-^)^ 

Now a typical method to approximate a would be to choose \ = Cij,\ , J G A* and 
zero otherwise. However, this selection does not depend continuously on a. Therefore 
we use the following variant. Let gj denote the following piecewise linear and odd 
function, 

10 if < x < , 

X a x>2- 2-^-'ej , 

hnear if x e i2-^'ej, 2 ■ 2-^'ej) . 

Then we set 

(65) a* A — 5j(%,A) 

and consider the associated approximation (60). Let us prove that Sn will do the 
job. 

Step 1. We shall prove (i). Observe 

K N K 

3=0 3=0 j=N+l 

cf. (63). The constant C2 is independent of a, /T, and n. This proves (i) and (ii). 
Step 2. Proof of (iii). We have 

K K 

3=0 xeVj j=o 

where 

AeVj 
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From Lemma 6, we can conclude that || Si ^^Jl < n"*/"^ for K large enough. 
Therefore it remains to estimate || T* |6p^_gJ|. Since \gj{x) — x\ < \x\ and a*^^ — 
Gj X for |a, A| > 2£j2~^^, we obtain 



T* lb" I 



< 


l"j,A| 


< 


i%,Ai^°i%,Ar-^° 


< 


|a,,,r°(2£,)^^-^°2-- 


norm of T* as follows: 




\ 1/pl 


E 


i«i,A-4Ar' ) 







/ \ J^/Pl 



1/pl 



^ „ i-Po/pir,-j{t+d/2-dpi/{2po))po/pi II „ lUS+t ||po/pi 

^ C2£:j- Z ll"l%,ooll 

< ^2 ^l-po/pi2-j(t+<i/2-dpi/(2po))po/pi 



where again C2 does not depend on a and n. For j > N we continue by employing 



the concrete value of ej and obtain 



T*\b^ II < C2 ^n"^/^°2"-'''^^^/^"'^/^°^2"^'*2^-''"^)''/^°y ^°^^'2"^'(*+^^^^ 



By construction T* = ii j < N, by definition, we have 



t-di^-^]>5(^- ' 



-Po Pi'' Vpo Pi 
Hence, with u — min(l,pi, qi), we have 

j=N+l 

< ^^i/pi-i/po 2-JV<5(i/po-i/pi) 2"-'^"(*~'^^^/^°~^/*'i)~'^/^°+''/^i) 

= C3 ^^Vpi-Vpo^ 2"^"^*"'^(^/^°"^/^i)) , 
with C3 independent of K, n and a. Recalling that 2^'^ — n, we end up with 

||S2|6;,,,J| < csn-'/'. 
This finishes the proof of Proposition 2. □ 
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For completeness and better reference we formulate the counterpart of Proposition 2 
in the case po > pi- 

Proposition 3. Let < pi < po < oo. Let a e U (the unit ball in b^'^^^) and 
2Nd <_ n < Then the approximation 

N 

(66) Sna ^o,^^3,x 

of a satisfies the following: 

i) The coefficients Uj^x depend continuously on a. 

a) The number of nonvanishing entries is bounded by c- n. 

Hi) II a - 'S'na gjl < cn~^/'^ , n=l,2, .... 
Here, c can be chosen independent of a and n. 

Proof. The proof is elementary. □ 

Proof of Theorem 7. The estimate from above follows from Propositions 2 and 
3, as well as the continuous embedding 6^+*^^ ^ Ko!oo- For the estimate from below, 
it will be enough to consider n — 2^*^, where N > J and N e N. Let K be the 
smallest natural number such that Ci 2^'^ > 2 (here Ci is the same constant as in 
(52)). Then 

Ci 2(^+^)<^ 1 
n < < - \Vn+k\ ■ 

Let r C Vjv+K with |r| — n. We define 

a=|V^+Kri/^°2-(^+^)(^+*+''(V2-i/po)) J2 eN+K,x. 

Consequently || a ||js+t^ = 1 for any qq. Furthermore, we find 



AeVjv+K\r 

= \Vn+k\~^^^° 2-(^+-^)(*+''(Vpi-Vpo))|v^^^ \ r|i/pi 
~ 2VpiC2/^° 



6* 

PI. 91 



21/pi c-Vpo 
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(also 6*2 has the same meaning as in (52)). It is clear that an optimal F with |r| = n 
has to be a subset of Vn+k- This completes the proof of the estimate from below. 
□ 



Proof of Theorem 6(i). The estimate from above is covered by Theorem 7; the 
estimate from below follows from Theorem 1 and Theorem 6(iii). □ 

Remcirk 20. Stepanets [78] has investigated the quantities 
for the specific case 



"PI, 91 



s — d ( y with P\ — qi- 

In this special case, the associated nonlinear withs related to quite general smoothness 
spaces are studied. He proved explicit formulas from which the asymptotic behavior 
could be derived. 

4.2.3 The Manifold Widths of the Identity 

Proof of Theorem 6(iii). Without loss of generality we may choose s = 0, cf. 
Lemma 3(ii) and Remark 17. 

Step 1. The estimate from above. In the case pi = gi = 2 we may use Propositions 
1, 2 and 3 to get the desired inequality. However, for the general case we have to 
modify the argument. We follow the arguments used in [31]. Let U denote the unit 
ball in As explained there Propositions 2 and 3 guarantee that 

where a" denotes the Alexandroff-co- width, cf. [31] for details. But 



^0 ^ 



cf. [31] and [40]. Let us mention that in the literature quoted the target space was 
always a normed linear space. But the arguments carry over to quasi-normed linear 
spaces. 

Step 2. The estimate from below. Lemmas 1 and 5 yield the lower estimate in case 
< Po < Pi < oo. 
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Now, let Pi < Po ^ oo. Let e > 0. Wc consider the diagram 

hO v-d{l/pi-l/po)-e 
pi,qi P0,oo 

h \ / h 

Po,?o' 

where /i, I2 and I2, are identity operators. Then (23) yields 
which implies that 

for some positive ci and C2 (independent of n), see Lemmata 5, 1, and Step 1. □ 



Remark 21. It is clear from the proof given above that the knowledge of the Bern- 
stein widths is not enough to establish the estimate from below of 6^°"^*. Here the 
multiplicativity of the numbers ejf"*, cf. (23), is crucial. This seems to be over- 
looked in [31]. 

4.2.4 The Approximation Numbers of the Identity 

Proof of Theorem 6(ii). Step 1. Let 2 < p < 00. From Proposition 3 wc obtain 
the estimate from above with Sn given by (66). The estimate from below is covered 
by (58). 

Step 2. Let < p < 2. Without loss of generality wc assume s = 0. Let Sn be defined 
by (66). The estimate from above is easily derived by using the monotonicity of the 
4-norms and t + rf(l/2 - 1/p) > 0: 

2/p 



a-S^a\bl,f < E (E M')' 
j=N+i xeVj 

00 y 1 / \ ■ 

< ( y 2-2jmv2-i/p))U sup 2^(t+d(i/2-i/p))f y /n 



j=N+l 

^ ^2-2N{t+d{l/2-l/p))nnt ||2 



p,oo I 
2 



< c(n-'/''-'/'+'/P\\a\blJ\^ , 



where c does not depend on n and a. For the estimate from below, we use the obvious 
fact that the optimal approximation of an element in a Hilbert space is given by the 
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partial sum with respect to an orthonormal basis. Hence, if Sn is a linear operator 
of rank at most n then 

II a - 5'na|&o,o|| > || a - 5'na|6o,o|| , 
where Sn is defined by (66). We put 

N+l 
j=0 



^3 ^ ^ J 

N+l 



a\blg\\ = ^ ^ 2^(*+'^(^/2-^/*'))«^ > 2^(*+<V2-i/p)) 



where Xj e Vj can be chosen arbitrarily. Then 

V? 

j=0 

for some positive c independent of n and 

II a — <S'„a|62,2ll — 1 ■ 

This implies 

II 7" _ Q l/i* II > \ 

II J- l^p^gll _ 2N{t+d{l/2-l/p)) ' 

which finishes the proof of the lower bound. □ 



Remark 22. Notice that in any case, an order-optimal approximation is given by 
an appropriate partial sum, see (66). 

4.2.5 The Gelfand Widths of the Identity 

What we will do here relics on a result of Gluskin [45, 46] about the Gelfand widths 
of the embedding which we now recall. Let = 1. For all natural 

numbers m and n, where n < m, it holds that 
(67) 

{(m-n+l)2p if 2<p<oo, 

1 if l<p<2 and l<n<m2/f', 

^i/p' -f 1 < p < 2 and m^/^' <n <m. 

A simple monotonicity argument leads to the following supplement to p = 1. There 
exists a constant c, independent of m and n, such that 

(68) d"(/,£™,£^) < 



37 



if < J9 < 1 and 1 < n < m. 

The Gelfand widths are examples of so-called s- numbers, cf. [72, 73] and [10]. Fol- 
lowing Pietsch [72, 2.2.4, p. 80] we associate with the sequence of Gelfand widths 
the following operator ideals. Let F and E be quasi-Banach spaces and denote by 
C{F, E) the class of all linear continuous operators T : F ^ E. Then, for < p < oo, 
we put 

42o := {t e C{F, E) : sup n^'^' d^T) < ooj . 
Equipped with the quasi-norm 

Xr{T) :=sup v}''(r{T), 

the set df)yo becomes a quasi-Banach space. For such quasi-Banach spaces there 
always exist a real number q & (0,1] and an equivalent quasi-norm, here denoted by 
II • |>Cr^oo||, such that 

(69) II Ti + T, \C^^lr < II Ti \4%r + II T2 14111^ 
holds for all Ti, T2 e jC.i%>. 

To shorten notation we shall use the abbreviation /"^ for the identity / : ^ . 
It is not complicated to check that (67), (68) imply the following estimates for 
||/-2|41||,cf. [58]. 

Lemma 7. Let < r < 00. 

(i) Let2<p<oo. Then 

(70) 11^2 14111^ 
holds. 

(ii) Letl<p<2. Then 

(7^\ II ir(^) II ^ / if 0<r<2, 

['^) ll^p,2l^.,ool|-^ I if 2<r<oo, 

holds. 

(iii) Let < p < 1. Then there exists a constant c such that 



(-) 11 C 14111. cf-"^ llll^ 

1 ij 2 < r < 00 , 



holds for all m e N. 
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To prove the estimates of the Gelfand numbers from above, it turns out to be useful 
to spUt the identity / into two parts id^,id^ and to treat them independently. In 
fact, we shall investigate || id* |£r^|oo||, i = 1,2, where ri and r2 are chosen in different 
ways. For basic properties of the Gelfand numbers we refer to Remark 7 and [10, 
2.3]. 

Theorem 8. Let < q < oo. 

(i) Let 1 < p < 2 and suppose that t > d/2. Then 

(ii) Let 2 < p < oo and suppose that i > 0. Then 

(iii) Let < p < 1 and suppose that 

(73) *>K^-\ 
Then there exist two constants Ci and C2 such that 

ci n-*/'' < b;+\ 6^_2) < 02^-*/'^-^+^/^' . 

Proof. Without loss of generality we may assume s = 0. To see this consider the 
diagram 

us+t ^1 - us 



Ut ^ uO 

^p,q ^ "2,2 ? 

where Lg denotes the isomorphism introduced in Remark 17. The multiplicativity 
of the Gelfand numbers implies that 

d"(7i, 6^2,2) < II II II II <r{h, 6° 2) , 

compare with Remark 7. Changing L_s into Ls and vice versa in the diagram above 
we end up with 

C?"(/i,&p:/'^2,2)=C?"(/2,6;,„&^,2)- 

Step 1. Estimate from above. We concentrate on natural numbers n = 2^'^ for 
N & N (the remaining can be treated by the monotonicity of the d"). Let idj denote 
the projection given by 



m,\ I otherwise . 
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We split the identity / into a sum / = id^ + id^ depending on N, where 

TV oo 

id^ := ^^idj and id^ := id^ . 

j=0 j=N+l 

Later on we shall apply the following observation. Consider the diagram 

p 

where P and Q are defined as follows. Let a = (a^^A)^,A- Then 

(P(a))A := %,A ■ 

For h = (6a) A we define 



Obviously, 



%,A if j = , 

otherwise . 



II P II = 2-^-(t+<i(i/2-i/p)) and II g II = 1 . 
Then property (27) for the Gelfand numbers yields 

^"K,&;+*,6^,2) < II ^11 IIQIM"(/r2'') 

(74) < 2-^(*+''(V2-i/p))^n(jlJil)_ 

Substep 1.1. The estimate of (i"(id\ 6* ^, 622)' ^ — First we suppose 2 < p < 00. 
Thanks to (69), (70), and (74) we find 

N 

l|idM41ll^ < Ellid^l^tll' 

j=0 

N 

< 2"-''^*+'^(^/^"^/^'))^ II z'^^' ivC^"") 11^ 

— / ^ 1 1 p,2 I 7",00 1 1 

j=Q 

N 
j=0 

(75) < C2 2^^''/"-*)^ 

if (i > t r. Choosing r small enough, we derive from the definition of Cr,oo that 

(76) d"(id^) = d2"'(id^) < C3 2-^* = C3 n-*/'^ . 
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Now we consider the case 1 < p < 2. As above, but using (71) instead of (70), we 
find 

I|id^l4;ill<c2 2^w^-*) 

a l/r > t/d and 1/r > 2. Clioosing r small enough, we obtain 

(77) c^2-"(idi) <C4 2-^*. 

Finally, we investigate the case < p < 1. As above, we obtain 

(78) d'"\id') < C5 2-^(*+'^-'^/^) = C5 71-*/'^-^+^^ . 
Substep 1.2. The estimate of rf"(id^, fe*^, 62,2)' where n = 2^^. 

Again we split our considerations into the three cases p > 2 and 1 < p < 2 and 
< p < 1. First, let 2 < p < 00. Using (69), (70), and (74), we find that 



id^ l^^'^) F < 'S^ II id - IC^" 



c) \\S 
ooll 



j=N+l 



— / ^ 1 1 j?,2 I r,oo 1 1 

j=N+l 
00 

< Ci ^ 2-^(*+'^W^-yP))^ 2jdii/r-l/p+l/2)g 

j=N+l 

(79) < C2 2^('^/''-*)^ 

ii tr > d. Choosing r large enough (t > by assumption) , we derive 

(80) ci2"'(id2) <C3 2-^*. 
Now we consider 1 < p < 2. Similarly 

l|id^l41ll<C3 2^(^/^-*) if l< 

2 r d 

Since t > d/2, such a choice is always possible. Consequently, 

(81) d^"'(id^) <C4 2-^*. 
Finally, let < p < 1. Then 

(82) d^"'(id^)<c5 2-^(*+'^-'^/^) if i + i-l>i>i. 

d p r 2 

Such a choice is always possible if (73) holds. 

Substep 1.3. The additivity of the Gelfand widths yields 

d2"(id) < d"(id^) + d'*(id'). 
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In view of this inequahty, the estimate from above of the Gelfand widths follows 
from (76)-(82). 

Step 2. Estimate from below. Since 6„ < erf", cf. Lemma l(i), we may use Lemma 
5 here to derive the lower bound in the case < p < 2. For p > 2, we shall use 
a different argument. Again we restrict ourselves to a subsequence of the natural 
numbers n, where 



Consider the diagram 



2 ~ 2 



JVivl h p\Vn\ 



p 



where Ii and I2 denote identities and this time P and Q are defined as follows. Let 

b = {bx)xe\7j^. Then 



For a — (aj,A)j,A we define 



if j^N, 
otherwise . 



{Q{a))x ■= ajv,A , A e Vjv . 

Obviously, 

IIPII^2^(t+d(i/2-i/p)) ||g|| = i. 

Then property (27) for the Gelfand numbers yields that 

d"(7i,£l7-l,4^-l) < ||P|| IIQII d-{l2,bl^iW),bl,{W)) 
which, in view of Gluskin's estimates (67), implies that 

for some positive c (independent of N) . This completes the estimate from below. □ 

Remark 23. The use of operator ideals in such a connection and the associated 
splitting technique applied in Step 1 has some history, cf. [9, 58, 56]. Closest to us 
is [56], where these methods have been used in connection with entropy numbers. 



4.3 Widths of Embeddings of Besov Spaces 

Here we do not formulate a general result, since the restrictions on the domains are 
different for different widths. 
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4.3.1 The Manifold Widths of the Identity 

The main result of this subsection consists in the following non-discrete counterpart 
of Theorem 6. 

Theorem 9. Let Q be a bounded Lipschitz domain. Let < po,pi < oo, < qo, qi < 
oo, and s G M. Suppose that (55) holds. Then we have 

(83) er (7, <*(L,„(Q)), Bl^{L,,{m x n"*/'^ . 

Remark 24. Theorem 9 has several forerunners. We would like to m,ention De- 
Vore, Howard, and Micchelli [30], DeVore, Kyriazis, Leviatan, and Tikhomirov [31], 
and Dung and Thanh [40]. In these papers, the authors consider the quantities 
e^°"*(/, S*jj(Lp(j(0)), Lp^(O)). Note that from the continuous embeddings 

5°(Lp(Q)) ^ L,{Q) ^ B'^{L,{Q)) , 1 < p < oo , 

we obtain as a direct consequence of Theorem 9 

(84) eT{I:Bl{L,,{n)),L,,{n)) >:n-'/\ 

as long as 1 < pi < oo and t > (1/po — 1/Pi)+- So, Theorem 9 covers the results 
obtained before. However, let us mention that we used the ideas from [31] for our 
estimate from above and the ideas from [40] to derive the estimate from below (here 
on the level of sequence spaces). 

Proof of Theorem 9. Let S denote a universal bounded linear extension operator 
corresponding to Q, see Proposition 6 in Subsection 5.5. Let diamfi be the diameter 
of Q and let be a point in M'^ such that 

C {y : — y| < diamfi} . 

Without loss of generality, we assume that 

supp£fc{y: |a;° — 7/| < 2diamQ} . 

Let V be defined as in (99) and (100) (with Q replaced by the ball with radius 
2diamr^ and center x°). Let R denote the restriction operator with respect to Q. 
Let T denote the continuous linear operator that associates to / its wavelet series; 
T~^ is the inverse operator. Here we assume that we can characterize the Besov 
spaces S*+*jj(R'^), as well as Sp^^g^(R'^), in the sense of Proposition 5 in Subsection 
5.3. Then we consider the diagram 
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T 



Po,qo\ ^ ' 



(85) 



h i 



Observe that Ii = RoT ^ o I20T o E. From (85) and (27) for e*^""^*, we derive that 
eT\h.Bl^\L,,{n)lBl{L,,m < \\E\\ \\T\\ \\T-'\\ er\hX!J'^)^K.,J^)) ■ 
For the converse inequahty, we choose V* = such that 

suppV'i.ACQ, AeV*, J = -1,0, 1,... , 



and infj 2 ■''^ |V*| > 0. Then we consider the diagram 



(86) 



P0,90 

T-1 



Pi ,11 



(V*) 



and conclude that 

er\hX!J^*)^Ki,J^*)) < Wn \\T-'\\ er\luB^^:\LMlBl{L,M))- 
Now Theorem 6 yields the desired result. 



□ 



4.3.2 The Widths of Best m-Term Approximation of the Identity 

Let f2 be a bounded Lipschitz domain in M°'. We assume that for any fixed triple 
{t,p,q) of parameters the spaces B^'^*[Lp{Q)) and -B|(L2(f2)) allow a discretization 
by one common wavelet system B*. More exactly, we assume that (107)-(112) are 
satisfied simultaneously for both spaces, cf. Appendix 5.10. From this, it follows that 
B* e Be* for some 1 < C* < 00. 

Theorem 10. Let fl be as above. Let < p < 00, < q < 00, s E and 

\p 2) + 

holds. Then, for any C > C* we have 
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Remark 25. i) Periodic versions on the d- dimensional torus T'^ may be found in 
Temlyakov [82, 83] with B2{L 2(^1)) replaced hy Lp^{T'^) andpi,p,q > 1. Furthermore, 
more general classes of functions are investigated there (anisotropic Besov spaces, 
functions of dominating mixed smoothness). Finally, let us mention that estimates 
from below for the quantities 

inf sup an{u,B)L^^Td), 

o&U ||,,|| , <1 

where O is the set of all orthonormal bases, have been given by Kashin {pi = qi = 
00, d = 1) and Temlyakov [82, 83] (general anisotropic case). Instead of the manifold 
widths these authors use entropy numbers. 

a) We stress that, in this paper, we study the approximation in some Hilbertian 
smoothness space i?2(-^2(^)) while most known results from the literature concern 
approximation in an Lp{Q)- space. 

Remark 26. We also recall the following limiting case. Let < p < 2 and t = 
d{l/p — 1/2). Then the embedding Bp'^^{Lp{fl)) i?|(L2(0)) is continuous but not 
compact, cf. Proposition 7. Here we have 

i/p 




n 



an{u,B*)B^(L2(n) - < 00 if and only if u e .8*+ {Lp{^})) . 



A proof can be found in [20, Prop. 1], but the argument there is mainly based on 
DeVore and Popov [34], see also [32]. 

Proof of Theorem 10. Let B* be a wavelet basis as in Appendix 5.10. Let B 
denote the canonical orthonormal basis of &2 2(^)- We equip the Besov space with 
the equivalent quasi-norm (112). Observe, 

where c is one of the constants in (111). By means of Theorem 6 and Remark 2(iii), 
this implies the estimate from above. The estimate from below follows by combining 
Theorem 1 and Theorem 9. □ 



The simple arguments used in the proof of Theorem 10 allow us to carry over 
Remark 26 to the sequence space level, see Remark 18, and Theorem 7 to the level 
of function spaces. 

Theorem 11. Let fl and B* be as above. Let < Po,Pi, qo, qi < 00, s e M and t > 
such that (55) holds. Then we have 

sup {<7„(k, S*)B|^(L,,(n)) : II u\B^+\L,,{m\ < l} n-'/'' . 
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Remark 27. i) For earlier results in this direction we refer to Kashin [54], Os- 
wald [68], Donoho [38] and DeVore, Petrova and Temlyakov 



ii) Not all orthonormal systems are of the same quality, see Donoho [38]. Let us 
mention the following result of De Vore and Temlyakov [36] . Let denote the 
trigonometric system in W^. By S|(Lp(T'^)) we mean the periodic Besov spaces 
defined on the d-dimensional torus T'^. Then we put 

{c^(l/Po-l/Pi)+ if < po < Pi < 2 or 1 < pi < po < oo, 

d max (l/po, 1/2) otherwise. 
If ^ ^ Pi ^ oo? < po, qo < oo, and t > t{po,pi), then 



sup 



n if po > max(pi,2) , 

^-t/d+i/po-i/2 < max(pi, 2) = 2 , 

^-t/d+i/po-i/pi < niax(pi, 2) = pi . 

4.3.3 The Approximation Numbers of the Identity 

Theorem 12. Let ft be a bounded Lipschitz domain. Let < p < oo, < g < oo, 
and s e R. Suppose that 

t> di--- 



holds. Then we have 



p 2 



n"*/'' if 2 < p < oo , 



er (/, 5r*(^.(^))> B^^iHm - ^ ^-./.,Vp-i/2 < p < 2 , 

Proof. The statement is a consequence of Theorem 6(ii), Proposition 6, (101) and 
(102). □ 

Remcirk 28. (i) The proof is constructive. An order-optimal linear approxima- 
tion is obtained by taking an appropriate partial sum of the wavelet series of 
£f, where £ is the linear universal extension operator from Proposition 6, cf. 
Remark 22 for the discrete case. 

(ii) This result is well-known. It can be derived from [91] and [43, 3.3.2]. There 
and in [7] information can also be found about what is known for the general 
situation, i.e., in which S|(L2(J1)) is replaced by B^^{Lp^{D,)) . However, let 
us mention that there are many references which had dealt with this problem 
before; we refer to [81, Thm. 1.4.2] and [85, Thm. 9, p. 193] and the comments 
given there. 
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4.3.4 The Gelfand Widths of the Identity 

Theorem 13. Let dW' be a bounded Lipschitz domain and let < q < oo. 

(i) Let 1 < p < 2 and suppose that t > d/2. Then 

d\I, B:;\L,{SI)), B^iL^iSl))) X n-"" . 

(ii) Let 2 < p < oo and suppose that t > 0. Then 

d^(I, *(L^(Q)), S|(L2(Q))) X n-'/' . 

(iii) Let < p < 1 and suppose that 

Then there exists two constants Ci and C2 such that 

ci n-*/'^ < B^+\L,{n)), BliL^m) < c^n"''-^^^!^ . 
Proof. Consider the diagram 

where T and are defined as in the proof of Theorem 9. Since Ji = T'^ o I20T, 
it is enough to combine property (27) for the Gehand numbers and Theorem 8 to 
derive tlie estimates from above. For the estimates from below, one uses the diagram 

where V* is defined as in proof of Theorem 9. This completes the proof. □ 

Remcirk 29. Partial results concerning Gelfand numbers of embedding operators 
may be found in the monographs Pinkus [73, Chapt. VII, Thm. 1.1], Tikhomirov [85, 
Thm. 39, p. 206], and Triebel [88, 4.10.2]. Let T be a compact operator in C{F,E), 
where F,E are arbitrary Banach spaces and let dn{T, F, E) denote the Kolmogorov 
numbers. Then 

d''{T')^dn{T), neN, 

holds, cf. [10, Prop. 2.5.6] or [71]. For Kolmogorov numbers the asymptotic be- 
haviour is also known in certain situations, cf. [73, Chapt. VII, Thm. 1.1], [85, 
Thm. 10, p. 193], [88, 4.10.2], and [81]. 
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4.4 Proofs of Theorems 2, 4, and 5 

4.4.1 Proof of Theorem 2 

For s > we have H-'{Q) = B^'{L2{Vl)). Hence, Theorem 12 yields 

Since S : H^'^{Q) HqIQ) is an isomorphism, we obtain the desired result from 
property (27) for the approximation numbers. 

4.4.2 Proof of Theorem 4 

Since of H~'{n) = S2~'(L2(J1)), Theorem 10 yields that 

Since 5* : H~^[fl) Hq{Q) is an isomorphism. Lemma 3(ii) implies the desired 
result. 

4.4.3 Proof of Theorem 5 

All what we need from the wavelet basis is the following estimate for the best n-term 
approximation in the i7^-norm: 

(87) \\u-SM)\\HHn)<c\\u\Bt^\L,{m\n-'/\ where ^ = ^ + ^, 

sec, e.g., [20] (however we could instead use Theorem 11). We therefore have to esti- 
mate the Besov norm B^{LT-{fl)). Since 1 < p < 2, the embedding 
B^-\Lp{n)) ^ W^-\n) holds, cf. e.g. [89, 2.3.2, 2.5.6]. Hence our right-hand 
side / is contained in the Sobolcv space Wp^^{Q). Therefore we may employ the 
fact that u can be decomposed into a regular part ur and a singular part us, i.e., 
u — ur + Us, where ur e Wp~^^{Lp{fl)) and us only depends on the shape of the 
domain and can be computed explicitly, cf. Grisvard [49, Thm. 2.4.3]. We introduce 
polar coordinates (r/, 9i) in the vicinity of each vertex and introduce the functions 



O(n)?^^'" sm{m7r6i/LJi) if A;.^ := mTr/uJi ^ integer , 



Si,m{ri,9i) :-- 



(i{ri)r^''"^[logrisin{m7i6i/uJi) + 9icos{'m7i9i/uJi)] otherwise. 

Here (i, . . . ,(n denote suitable C°° truncation functions and m is a natural number. 
Then for / e Wp~^{fl), one has 

N 

Us = ^ ^ Q,m ^l.m , 

1=1 0<\l^rn<k+l-2/p 
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provided that no Xi^m is equal to k + 1 — 2/p. This means that the finite number of 
singularity functions that is needed depends on the scale of spaces we are interested 
in, i.e., on the smoothness parameter k. According to (87), we have to estimate the 
Besov regularity of both, us and ur, in the specific scale 

where ^ = ^ + 
Since ur G Wp~^^{n), the boundedness of fl implies the embedding 
W^+\n)-^B'^+'-\L,{n)), with 5>0, 0<q<p, A; + l>2(^-^). 

Hence 
(89) 

Ur e B';+^-\L^{n)), with - = ^ ^ + - for arbitrarily smaU 6>0. 

T 2 2 

Moreover, it has been shown in [16] (see also Remark 31) that the functions Si^m 
defined above satisfy 

(90) Si,miri, di) e for all < q < oo . 

By combining (89) and (90) we see that 

ue B'^+^-\Lr{n)), where i ^ ~ ^ ^ for arbitrarily small S > 0. 

T 2 2 

To derive an estimate uniformly with respect to the unit ball in Bp~^{Lp[Q)) we 
argue as follows. We put 

J\f :^spein\^Si,m{ri,9i) : < Xm,i < k + 1 - 2/p , 1^1, ... ,Ar}. 

Let 7; be the trace operator with respect to the segment F;. Grisvard has shown 
that A maps 

H:=\^ue W^+\Q) : jiu = , I = 1, . . . , + M 

onto Wp~^{il), cf. [48, Thm. 5.1.3.5]. This mapping is also injective, see 
[48, Lemma 4.4.3.1, Rem. 5.1.3.6]. We equip the space H with the norm 

TV 

II := II 'Wi? + 'Ws Hi? = II 'Wr + ^ ^ |Q,m| , 

1=1 0<A,,„<fc+l-2/p 

see (88). Then H becomes a Banach space. Furthermore, A : H ^ Wp^^{Q) is 
continuous. Banach's continuous inverse theorem implies that the solution operator 
is continuous, considered as a mapping from Wp~^{fl) onto H. Finally, observe that 

N 

II Ur + Us ||sfe+i-«(L^(f2)) < C (^\\ Ur ||iyfe+i(n) + Yl Y l^'.'^l) 

1=1 0<A(,^<fe+l-2/p 
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with some constant C independent of u. 



□ 



5 Appendix — Besov spaces 

Here we collect some properties of Besov spaces that have been used in the text 
before. Detailed references will be given. For general information on Besov spaces, 
we refer to the monographs [62, 63, 69, 74, 89, 90]. 



5.1 Besov Spaces on Mr and Differences 

Nowadays Besov spaces are widely used in several branches of mathematics. Prob- 
ably the most common way to introduce these classes makes use of differences. For 
M e N, /i e M'^, and / : M'^ ^ C we define 

M 



0=0 

Let < p < oo. The corresponding modulus of smoothness is then given by 



^^(i,/)p:=sup ||A;r7l|L,(M^), t>Q. 

\h\<t 

One approach to introduce Besov spaces is the following. 

Definition 4. Let s > and < p, q < oo. Let M be a natural number satisfying 
M > s. Then A^{Lp{M.'^)) is the collection of all functions f e Lp{W^) such that 



1/9 

< OO 



if q < oo and 

I / \A^^{L^m) ■= sup t~' UJ^{t, f)p < OO 

if q — oo. These classes are equipped with a quasi-norm by taking 

II / l|A|(Lj,(Mrf)) '-^ II / ||Lj,(Md) + I / |A|(Lp(]R'i)) ■ 

Remark 30. It turns out that these classes do not depend on M, cf. [35]. 

Remark 31. Let g G C(j^(M"') be a function such that q{0) ^ 0. By means of the 
above definition it is not complicated to show that a function 

fa{x):^\x\°'g{x), xeW^, a>0, 
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belongs to A^'^''^(Lp(]R'^)) and that this is best the possible (if a is not an even 
natural number), of. [74, 2.3.1] for details. A minor modification shows that 

faA^) - l^l"" i^og\x\f gix) , xeR^ a,P>0, 

belongs to A'^'^^'''' {Lp{W^)) for alle, <e <a + d/p. 

5.2 Besov Spaces on M.^ and Littlewood-Paley Characteriza- 
tions 

Since we are using also spaces with negative smoothness s < and/or p, g < 1 we 
shall give a further definition, which relies on Fourier analysis. We use it here for 
introductory purposes. This approach makes use of smooth dyadic decompositions 
of unity. Let ip G C^(M'^) be a function such that i^(a;) = 1 if |x| < 1 and ip{x) =0 
if \x\ > 2. Then we put 

(91) ^o{x) := ^(x), ^j(x) := ^(2-^x) - ^(2-^+^x) , jeN. 

It follows 

oo 

5^<^,(x) = l, xeR^ 

j=0 

and 

supp ipj C ja; e M'' : 2^-^ < \x\ < 2^+^| , j = 1, 2, . . . . 

Let and denote the Fourier transform and its inverse, both defined on S'{W'). 
For / e .S'(R'^) we consider the sequence J^~^[(pj{^) J^f{^)]{x), j e No, of entire 
analytic functions. By means of these functions, we define the Besov classes. 

Definition 5. Let s e M and < p, g < oo. Then B^(Lp{R^)) is the collection of 
all tempered distributions f such that 



1/5 

< OO 



if q < oo and 

II mM^'m = sup ^/(0]( • ) MR')\\ < oo 

j=0,l,... 

ifq^oo. 

Remark 32. i) If no confusion is possible we drop M.'^ in notations. 
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ii) These classes are quasi- Banach spaces. They do not depend on the chosen 
function ip (up to equivalent quasi-norms) . If t = min(l,p, g), then 

\\f+9 mL,)\\' < II / mL,)\\' + II g mL,w 

holds for all f,g& Bg{Lp). 

Proposition 4. [89, 2.5.12]. Let < p,q < oo and s > d max(0, 1/p — 1). Then we 
have coincidence o/A^(Lp) and B^{Lp) in the sense of equivalent quasi-norms. 

Remark 33. i) For s < d max(0, l/p — 1) we have A^(Lp) ^ B^{Lp). E.g., the 
Dirac distribution 5 belongs to B^^^^~^\Lp), cf. [74, 2.3.1]. 

ii) Smooth cut-off functions are pointwise multipliers for all Besov spaces. More 
exactly, let ip eV. Then the product ip f belongs to B^{Lp) for any f e Bg{Lp) 
and there exists a constant c such that 

U f \B^^{Lp)\\ < c\\ f \B^^{Lp)\\ 

holds, see e.g. [89, 2.8], [74, 4.7]. 

5.3 Wavelet Characterizations 

For the construction of biorthogonal wavelet bases as considered below, we refer to 
the recent monograph of Cohen [12, Chapt. 2]. Let </? be a compactly supported 
scahng function of sufficiently high regularity and let ipi, where i = 1, . . . 2^* — 1, be 
the corresponding wavelets. More exactly, we suppose for some N > and r e N 

supp v? , supp ipi C [-A^, A^]'^ , i = 1, . . . , 2"^ - 1 , 
ip,A^C'-{R''), z = l,... ,2-^-1, 

J x'^ ipi{x) dx ^ for all |Q;|<r, i = 1, . . . , 2*^ - 1 , 

and 

(p{x - k), 2^'^/2 ^.(2^x -k), J e No , A; e 
is a Riesz basis in L2{W'). We shall use the standard abbreviations 

^i,j,k{x) = 2-''^/2 ^.(2Jx - k) and (pk{x) = (p{x - k) . 
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Further, the dual Riesz basis should fulfill the same requirements, i.e., there exist 
functions ^ and ipi, i = 1, . . . ,2'^ — 1, such that 

{(fik, <fe) = Sk,i (Kronecker symbol) , 

{'4'i,j,k,'4'u,v,e) = Si^uSj^v Sk,£ , 

supp ^ , supp ■ijji C [-A^, A^]'' , i = 1, . . . , 2'' - 1 , 

Jx'^i^i{x)dx = for all \a\ < r , i = 1, . . . , 2'' - 1 . 
For / e S'{R'^) we put 



(92) (/, A,j,k) = fii^i,j,k) and (/, ipk) = /(<^jfc) , 
whenever this makes sense. 

Proposition 5. Let s e R and < p,q < oo. Suppose 

. . / 2d d \ 

(93) r > max [s, 1" 7: ~ ■ 

\ p 2 / 

Then Bg{Lp) is the collection of all tempered distributions f such that f is repre- 
sentable as 

f = ^ak(Pk+^ XlXl Hj,ki'i,j,k {convergence m S') 

i=l j=0 



with 

ifcezd ^ i=i j=o feezd ^ 

if q < 00 and 

^ J ' ?■— 1 2<*-l 7=0,... ^ ^ ' 

r/ie representation is unique and 

hold. Further I : f 1— > {{f,<fk), {f ■ii^i,j,k)} o''^ isomorphic map of B^^LpiW'')) onto 
the sequence space equipped with the quasi-norm || • |5|(Lp)||*, i.e., || • |S|(Lp)||* 
may serve as an equivalent quasi-norm on B^{Lp). 
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Remark 34. i) The restriction (93) guarantees that (92) makes sense for all 

ii) It is immediate from this proposition that the functions (pk,'ipi,j,k,k £ Z^, 1 < 
i < 2'^ — l,j e No form a basis for B^{Lp) if max{p,q) < oo. By the same 
reasoning the functions 

form a Riesz basis for S|(L2). 

Hi) If the wavelet basis is orthonormal (in L2), then this proposition is proved in 
Triebel [92]. But the comm,ents made in Subsection 3.4 of the quoted paper 
make clear that this extends to the situation considered in Proposition 5. A 
different proof, but restricted to s > d{l/p — 1)+, is given in [12, Thm. 3.7.7]. 
However, there are many forerunners with some restrictions concerning s,p 
and q. We refer to [6] and [62]. 

5.4 Besov Spaces on Domains — the Approach via Restric- 
tions 

There are at least two different approacfies to define function spaces on domains. 
One approacli uses restrictions to O, of functions defined on R'^. So, all calculations 
are done on R*^. The other approach introduces theses spaces by means of local 
quantities defined only in Q. For numerical purposes the second approach is more 
promising whereas for analytic investigations the first one looks more elegant. Here 
we discuss both, since both were used. 

Let Q C R*^ be an bounded open nonempty set. Then we define B^{Lp{fl)) to 
be the collection of all distributions / e T>'{Q) such that there exists a tempered 
distribution g E B^{Lp{W'')) satisfying 

f{^)=g{^) for all (^eP(O), 

i.e. g\a = / in V'{n). We put 

\\f\B^,{Lp{nm :=inf || ^ |S^^(L,(R<^))|| , 

where the infimum is taken with respect to all distributions g as above. 

Let diam Q be the diameter of the set ^2 and let be a point with the property 

O. G \y : — y| < diamil > . 
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Such a point we shall call a center of Vt. Since smooth cut-off functions are pointwise 
multipliers, cf. Remark 33, we can associate with any / e B^[Lp{yt)) a tempered 
distribution g e B^{Lp) such that 5f|n = / in V'iVt), 

(94) C\\g\Bl{Lp)\\ < \\ f \Bl{Lp{m\ < W 9 \Bl{Lp)\\ 

(95) supp g C {x e R*^ : \x-x^\<2 diam Q} . 

Here < C < 1 does not depend on / (but on q). 

Now we turn to decompositions by means of wavelets. We use the notation from the 
preceeding subsection. Define 

(96) := |A; e Z'^ : |A;i-a;°| < 2^' diam 1] + A^ , i = 1, . . . , , j = 0, 1, . . . . 
Then given / and taking g as above, we find that 

2'*-! oo 

(97) g= ^{9,'Pk)'Pk+Yl X] X] ^3, A,j,k) A,j,k (convergence in S') 



and 

(98) |U|S^lLp)|| X [Y.\^gMr)"' + 



fcsAo 

2'*-! oo / \ 1/9 

EE2"""'"^"'"(Eite.v;«,.>r) ) 



< CXD . 



=1 j=o fceAj- 
The following more handy notation is also used. We put 

(99) V_i Ao 

(100) Vj := {(i,A;): 1 < i < 2"^ - 1 , A; G A^ j , j = 0, 1, . . . , 

V'j.A i'i,j,k, if A = (i, A;) e V^-, j G Nq, and ipj^x := </^fe if A = A; e V_i. For the dual 
basis, (97) and (98) read as 

oo 

(101) 9 =^^{9,i>3,\}'^3,\ (convergence in S') 

j=-i xeVj 

and 

1/9 



(oo , \ i/q 

i=-i AeVw ^ 



oo 
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5.5 Lipschitz Domains, Embeddings, and Interpolation 

We call a domain D, a special Lipschitz domain (see Stein [77]), if Q is an open set 
in W'' and if there exists a function uj : R'^"^ M such that 

Q = \^{x',Xd) e M"^ : Xd> a;(a;')} 

and 

\uj{x')-uj{y')\<C\x' -y'\ for all x',y'eR'^~\ 

and some constant C > 0. We call a domain Q a bounded Lipschitz domain if Q is 
bounded and its boundary dQ can be covered by a finite number of open balls B^, 
so that, possibly after a proper rotation, dfl fl for each A; is a part of the graph 
of a Lipschitz function. 

Proposition 6. Let fl E M'^ be a bounded Lipschitz domain with center x^ . Then 
there exists a universal bounded linear extension operator £ for all values of s,p, 
and q, i.e., 

{Sf)\n^f for all f e B'^{L,{n)) , 

and 

\\£ : B^^{L,{n))^B^^{L,{R'))\\<oo. 
In addition we may assume 

(103) supp £f g{x eW^ : \x-x°\<2 diam Q} . 

Remark 35. Proposition 6 has been proved by Rychkov [75]. Property (103) follows 
from Remark 33. 

Let us now discuss some embedding properties of Besov spaces that are needed 
for our purposes. 

Proposition 7. Let Vt dW^ be an bounded open set. Let < po,pi, qo, qi < oo and 
let s,t eM.. Then the embedding 

I : B^^:\L,,in)) ^ B^jL,,{n)) 

is compact if and only if 

(104) t>d(---) . 

RemEirk 36. Sufficiency is proved e.g. in [43]. The necessity of the given restrictions 
is almost obvious, but see Lemma 4 and [57] for details. 
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Sometimes Besov spaces or Sobolev spaces of fractional order are introduced by 
means of interpolation (real and/or complex). Here we state following, cf. [91]. As 
usual, ( ■ , ■)e,q and [ • , • je denote the real and the complex interpolation functor, 
respectively. 

Proposition 8. Let ^ be a bounded Lipschitz domain. Let < qo,qi < oo and let 
so,si e R. Let < e < 1. 

(i) Let < p,q < oo. Suppose Sq ^ S\ and put s = (1 — O) Sq + © ^i. Then 



B^q°{Lp{n)),B^l{Lp{n))] = B^{Lp{Q,)) (equivalent quasi-norms) . 

/ e,q 

(ii) Let < po,Pi < oo. We put s — {1 — Q) Sq + Q Si, 

1 1-0 , 1 1-0 
- = \ and - = 1 . 

p Po Pi q qo qi 

Then 

Bg'^{Lp^{il)), B-^l{Lp^{il)) ^ = B-^{Lp{n)) (equivalent quasi-norms) . 

5.6 Besov Spaces on Domains — Intrinsic Descriptions 

For M e N, /i e W^, and / : E'^ ^ C we define 

M 

^M,, ^ J EC0(-ir-^7(^ + j/i) if x,x + h,...,x + Mhen, 

otherwise . 

The corresponding modulus of smoothness is then given by 

u;^(t,f)p:^ sup ||Af/|U^(n), t>0. 

\h\<t 

The approach by differences coincides with that using restrictions as can be seen by 
the recent result of Dispa [37] . 

Proposition 9. Let Q be a bounded Lipschitz domain. Let M e N. Let < p,q < oo 
and d max(0, 1/p — 1) < s < M. Then 

B-^{Lp{n)) = |/eL^a.(p,i)(^^): 

[t-^u''it,f)p\'^y\oo^ 



□ ._ 



in the sense of equivalent quasi-norms. 
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5.7 Sobolev Spaces on Domains 

Let Q be a bounded Lipschitz domain. Let m e N. As usual H'^{Q) denotes the 
collection of all functions / such that the distributional derivatives D"/ of order 
|q;| < m belong to -L2(fi). The norm is defined as 

|Q|<m 

It is well-known that H"'(E.'^) = 5^(L2(M'^)) in the sense of equivalent norms, cf. 
e.g. [89]. As a consequence of the existence of a bounded linear extension operator 
for Sobolev spaces on bounded Lipschitz domains, cf. [77, p. 181], it follows that 

H'^in) = B^{L2{n)) (equivalent norms) 

for such domains. For fractional s > we introduce the classes by complex interpo- 
lation. Let < s < m, s ^ N. Then, following [59, 9.1], we define 



e = 1- 



J e m 

This definition does not depend on m in the sense of equivalent norms. This follows 
immediately from 

s 







H'"(n),L2{n) = B^{L2{n)),Bl{L2{n)) =s|(L2(Q)), e = i- 



J 



m 



(all in the sense of equivalent norms), cf. Proposition 8. 



5.8 Function Spaces on Domains and Boundary Conditions 

We concentrate on homogeneous boundary conditions. Here it makes sense to intro- 
duce two further scales of function spaces (distribution spaces) . 

Definition 6. Let n <zW^ he an open nontrivial set. Let s e R and < p, g < oo. 

(i) Then Bg{Lp{ft)) denotes the closure of T>{Q) in B^{Lp{Q)), equipped with the 
quasi-norm of B^{Lp{fl)) . 

(ii) Let s > 0. Then i?o(^) denotes the closure ofT>{Q) in H^{fl), equipped with the 
norm of H'^{Q). 

(iii) By B^{Lp{Q)) we denote the collection of all f e V'{fl) such that there is a 
g e B'q{Lp{R'^)) with 

(105) g^^ — f and suppg^ C , 

equipped with the quasi-norm 

\\f\B^,{Lp{m\-i^i\\9\B',{Lp{R'm, 
where the infimum is taken over all such distributions g as in (105). 
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Remark 37. For a hounded Lipschitz domain B^{Lp{Q)) = B^{Lp{Q)) = B^{Lp{Q)) 
holds if 

< p,q < oo , max ( d ( 1)) < s < - , 

\p \p / / p 

cf. [48, Cor. 1.4.4.5] and [91]. Hence, 
ifO<s< 1/2. 

Often it is more convenient to work with a scale Bg{Lp{Q)), originally introduced 
in [91]. 

Definition 7. Let D, gW^ be an open nontrivial set. Let s e M and < p, q < oo. 
Then we put 



BliLpiQ)) 



BJ{Lp{n) if s<IIp. 

Bl{Lp{n)) if s>l/p. 



This scale 5*(Lp(r2)) is well-behaved under interpolation and duality, cf. [91]. 

Proposition 10. Let he a hounded Lipschitz domain. Let 1 < p,Po,Pi, Q, qo, qi < 
oo and let sq, si e R. Let < © < 1. 

(i) Suppose sq 7^ si and put s — {1 — Q) sq + Q si. Then 

B {Lp{Q)),B {Lp{il))] = B {Lpip)) (equivalent quasi-norms) . 

(ii) We put s = (1 - 0) So + si, 

1 1-0 , 1 1-0 
- = 1 and - = 1 . 

p Po Pi q qo qi 

Then 

i?g°(LpQ(f2)), 5*^(Lpj(f2)) = i?q(Lp(0)) (equivalent quasi-norms) . 

(iii) With s e R and 

-, 11 J . 1 1 
1 = — I and 1 — — I 

p p' q q 

we find 

{Bl{Lp{n))) = B-;{Lp,{n)). 

Here the duality must he understood in the framework of the dual pairing {T>{Q), T>'{Q). 
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5.9 Sobolev Spaces with Negative Smoothness 
Definition 8. For s > we define 

(-^o(^)) « - I 7^ integer, 



m)' 



otherwise . 



RemEirk 38. Let D, gM.^ be a bounded Lipschitz domain. Then 

H^{n) = B^{L2{n)) , s>0, s-^^ integer, 

of. [48, Cor. 1.4.4.5] and Proposition 9. From Remark 37 and Proposition 10 we 
conclude the identity 

(106) H-'{n) = B^'{L2{n)), s>0, 

to be understood in the sense of equivalent norms. 

,.s = 1 — 1/p 



s = 1/p 
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Remark 39. [88, 4.3.2]. Let Q be a bounded open set with a smooth boundary. Then 
B',{Lp{n)) = Bl{Lp{Q)) holds if 

1 1 , 

1 < p,q < oo , l<s<oo, s / integer. 

P P 

5.10 Wavelet Characterization of Besov Spaces on Domains 

It is a difficult task to construct wavelet bases on domains, see [12, 2.12] and the 
references given there. Under certain conditions on the domain Q such constructions 
with properties similar to (101), (102) are known in the literature, see Remark 11 

above. 

Let r2 be a bounded open set in M'^. Let p, q and s be fixed such that s > d max(0, 1 /p— 
1). We suppose that there exist sets Vj C {1, 2, . . . , 2*^ - 1} x Z^, with 

(107) 0< inf 2-^'^\Vj\< sup 2"^'^ | V^l < oo , 

j=-ho,- j=-i,o,... 

and functions ipj^x, ipj,\, A G Vj, j = —1, 0, 1, . . ., such that 

(108) suppV'j.A, supp^pj^x C Q , A e Vj- , 

(109) {i^i,j,k, i^u,v,i) = Si,u Sj,v 5k,e , 
and such that / e Bg{Lp{Q)) if and only if 

oo 

(110) f ^Yl XI '^^■'^) (convergence in V) , 

j=-i xeVj 

and 

(111) II / ll*j(Lp(n)) ^ II / llBJ(ip(f^)) ■ 



where 



(112) ll/ll?s(Mn)) - E 2'"*'«-J'"( E K/.*.A)r)" < oo. 
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